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ABSTRACT 


Certain  properties  of  multi-input,  multi -output  dynamic  systems 
which  are  of  interest  in  automatic  control  are  investigated.  It  is 
shown  that  such  systems  can  be  viewed  as  a  mapping  whose  domain  is 
the  set  of  possible  input  functions  and  whose  range  is  either  the 
set  of  responses  or  a  set  of  equivalence  classes.  By  selecting 
the  norms  for  the  input  and  output  spaces  in  various  ways  it  is 
possible  to  interpret  many  of  the  familiar  properties  of  a  system 
in  terms  of  these  mapp.  ..gs  and  new  means  of  system  characterization 
are  suggested  as  well.  It  is  shown  that  the  study  of  these  mappings 
leads  naturally  to  the  study  of  an  inverse  equation.  Conditions 
under  which  the  inverse  equation  exists  are  derived  for  some  linear 
and  nonlinear  systems  and  explicit  representations  for  the  inverse 
equation  are  given  for  certain  classes  of  linear  systems. 

The  effect  of  feedback  is  analyzed  in  terms  of  its  influence 
on  the  character  of  these  mappings  and  certain  limitations  on  feed¬ 
back  as  a  device  for  altering  a  given  system  are  noted.  It  is  also 
shown  that  the  inverse  equation  can  be  used  to  help  define  the 
optimum  input  for  a  certain  type  of  time-optimal  problems  where  the 
objective  is  stated  in  terms  of  the  outputs  rather  than  the  state. 
Examples  are  given  which  show  that  the  time-optimal  forcing  function 
cannot  always  be  generated  by  an  ideal  relay. 
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CHAPTER  I 


INTRODUCTION 

1.1  The  General  Problem 

A  basic  property  of  those  systems  which  are  of  interest  to  auto- 
matic  control  engineers  is  that  their  outputs  can  be  altered  to  meet 
changing  requirements  by  means  of  input  manipulation.  It  is  also 
true,  however ,  that  for  most  systems  there  are  definite  limitations 
on  the  types  of  responses  that  can  be  obtained.  For  example,  it  is 
usually  impossible  to  make  a  physical  variable  change  its  value  in¬ 
stantaneously.  In  general,  the  types  of  responses  that  can  be  ob¬ 
tained  from  a  system  depend  on  the  class  of  possible  inputs,  the 
present  state  of  the  system,  and  the  nature  of  the  equations  govern¬ 
ing  the  system.  The  purpose  of  this  research  is  to  study  the  nature 
of  these  dependencies,  to  characterize  certain  classes  of  systems 
which  have  the  property  of  being  able  to  generate  all  responses  in  a 
given  set,  and  to  indicate  certain  applications  of  these  ideas. 

The  systems  which  we  will  consider  are  all  special  cases  of 
those  which  can  be  described  by  a  pair  of  equations  of  the  form 

Z(t)  =  F(Z(t),X(t))  (1.1a) 

I(t)  =  G(Z(t),X(t))  (1.1b) 

The  symbol  X(t)  denotes  the  value  of  the  input  at  time  t  and  Y(t) 
denotes  the  corresponding  value  of  the  output.  The  vector  Z(t), 
which  is  usually  called  the  state,  is  not  assumed  to  bear  any  partic¬ 
ular  relationship  to  the  input  or  output  variables.  It  should  be 
thought  of  as  being  introduced  only  to  enable  the  first  order 
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representation  of  the  differential  equation.  The  inputs  and  out¬ 
puts  are  permitted  to  be  vector  valued  and  initially  are  not  assumed 
to  be  of  the  same  dimension  although  this  will  be  by  far  the  most 
important  case  for  our  work. 

If  we  are  interested  in  the  behavior  of  the  system  (l.l)  over 
a  specific  interval  of  time.,  say  [0,0].,  then  it  is  not  the  value  of 
the  variable  X(t)  at  any  one  time  which  determines  the  response  but 
rather  it  is  the  behavior  of  X(t)  over  the  entire  ii  ;erval.  To 
avoid  confusion  it  is  imparative  to  distinguish  between  a  function* 
that  is*  a  complete  description  of  how  the  variable  varies  with 
time,  and  the  value  of  a  function  at  some  particular  time  t.  As  is 
common  in  mathematical  literature  we  will  use  symbols  such  as  X,  Y, 
and  Z  without  arguments  to  denote  functions  and  use  X(t),  Y(t),  and 
Z(t)  to  denote  the  value  of  a  function  at  time  t. 

In  terms  of  this  notation  the  problem  under  consideration  can 
be  given  a  more  precise  statement.  Let  denote  a  set  of  functions 
defined  on  the  interval  [o,q] .  Assume  that  for  each  XeS^.  the 
equation  (l.la)  has  a  unique  solution  passing  through  a  given  in¬ 
itial  point  Z(0)  and  assume  that  this  solution  is  defined  over  the 
entire  interval  [o,qj.  This  fact  may  be  expressed  by  saying  that 
the  system  (1.1)  associates  a  unique  Y  with  each  X  in  or  by  say¬ 
ing  that  (1.1)  defines  a  (single  valued)  mapping  whose  domain  is  S^. 
The  problem  we  are  studying,  that  is,  the  problem  of  determining  the 
types  of  responses  the  system  is  capable  of,  is  identical  to  that  of 

The  symbol  e  should  be  read  "belonging  to". 
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determining  the  range  of  this  mapping. 

From  an  abstract  point  of  view  our  approach  will  be  to  introduce 
norms  in  the  input  and  output  spaces  and  then  to  discuss  the  con¬ 
ditions  under  which  certain  transformations  are  continuous  and 
invertible.  The  conditions  under  which  a  transformation  has  a  con¬ 
tinuous  inverse  will  be  particularly  important  since  this  corres¬ 
ponds  to  the  case  where  a  more  or  less  arbitrary  output  can  be 
reali zed. 

Because  our  approach  and  methods  are  somewhat  different  from 
those  currently  in  vogue  in  control  systems  theory  an  effort  has 
been  made  to  relate  the  properties  under  consideration  to  the  more 
familiar  concepts  of  stability*  minimum  phasity*  controllability* 
singularity  of  transfer  matrices*  etc.  The  remainder  of  this 
chapter  is  devoted  to  a  somewhat  informal  explanation  of  the 
approach  to  be  taken.  It  includes  a  brief  account  of  our  results 
as  well  as  some  of  the  results  of  other  workers  who  have  made 
similar  investigations  or  have  otherwise  contributed  to  this  study. 

Although  this  thesis  represents  an  engineering  study  it  has 
been  necessary  to  use  some  mathematical  terminology  in  order  to 
make  the  intuitive  ideas  precise.  This  is  particularly  true  in 
Chapter  II  where  the  objective  is  to  develop  the  essential  points 
in  such  a  way  as  to  make  them  acceptable  and  understandable  to  both 
control  engineers  and  applied  mathematicians, 

1 . 2  Systems  as  Transformations 

As  indicated  above*  once  Z(0)  is  fixed  the  system  (l.l)  can 

be  viewed  as  a  transformation  whose  domain  is  a  set  of  functions  S  . 

x 
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For  many  purposes  it  is  most  natural  to  regard  the  range  of  such  a 
transformation  as  being  the  set  of  response  functions,  i.e.,  the 
set  S  consisting  of  all  the  functions  Y  which  can  be  generated  by 

y 

an  X  in  S  .  If  this  point  of  view  is  adopted  then  one  way  of  de¬ 
termining  what  types  of  responses  are  possible  is  to  construct  a 
representation  of  the  inverse  mapping.  For  example,  if  X  and  Y  are 
related  by  a  system  of  the  form 

Z(t)  =  AZ(t)  +  BX(t)  (1.2a) 

Y(t)  *=  CZ(t)  +  DX(t)  (1.2b) 

and  if  D  is  nonsingular,  then  it  is  possible  to  ''solve"  (1.2)  for 
X(t)  in  terms  of  Y(t)  and  Z(t),  That  is,  we  can  eliminate  X(t)  from 
the  differential  equation  by  expressing  it  in  terms  of  Y(t)  and  Z(t). 
For  the  system  (1.2)  the  inverse  system  is  given  by 

Z(t)  =  (A-BD-1^Z(t)  +  BD_1Y(t)  (1.5a) 

X(t)  =  D-1Y(t)  -  D-1 CZ(t)  (1.3b) 

Both  (1.2.)  and  (1.3)  provide  complete  descriptions  of  the  re¬ 
lationship  between  X  and  Y.  The  important  difference  is  that  the 
representation  (1,2)  places  in  evidence  the  differential  equation 
relating  Z  to  X  whereas  (1.3)  places  in  evidence  the  differential 
equation  relating  Z  to  Y.  Since,  X  is  the  independent  variable  the 
representation  afforded  by  (1.2)  is  more  natural  in  most  cases  and 
hence  we  will  take  it  to  be  the  definitive  relationship  and  will 
regard  (1.3)  as  the  inverse  transformation. 

The  importance  of  the  inverse  system  is  twofold.  First,  the 
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very  fact  that  it  is  possible  to  derive  such  a  relationship  implies 
that  for  any  Y  such  that  (l„3)  has  a  unique  solution  there  exists 
an  X  which  generated  it  and  in  addition  (1,3)  gives  a  means  for  com¬ 
puting  it.  Second^  the  representation  (1.3)  allows  one  to  study 
how  X  varies  as  a  function  of  Y.  This  is  particularly  important 
in  cases  where  the  systems  of  equations  cannot  be  solved  and 
qualitative  methods  must  be  used. 

In  view  of  the  large  amount  of  work  which  has  been  done  on 
Liapunov  stability  it  is  natural  to  ask  how  instability  manifests 
itself  if  one  adopts  the  proposed  point  of  view.  In  particular; 
what  is  the  significance  of  an  unstable  inverse  equation? 

In  order  to  answer  these  questions  satisfactorily  it  is 
necessary  to  examine  the  concept  of  continuity  as  it  applies  to 
mappings  between  function  spaces, 

1,3  Continuity 

Loosely  speaking  what  we  have  in  mind  is  this.  If  the  eigen¬ 
values  of  A  have  negative  real  parts,  and  if  X  is  a  particular 
function  which  maps  into  Y,  then  any  X'  which  is  "close"  to  X  maps 
into  a  function  Y’  which  is  "close"  to  Y.  This  is  true  even  if  we 
are  interested  in  the  behavior  of  Y(t)  over  the  entire  interval 
[o,oo]  ,  It  is  not  necessarily  true,  however,  that  any  Y!  which  is 
"close"  to  Y  can  be  generated  by  an  X’  which  is  "close"  to  the  X 
which  generated  Y.  This  is  true  only  if  the  eigenvalues  of  the 
matrix  (A-BD  ’^C)  appearing  in  the  inverse  equation  have  negative 
real  parts. 
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At  this  point  these  statements  lack  precision  because  it  has 
not  been  specified  what  is  meant  by  ^close".  For  the  scalars  x(t) 
and  y(t),  the  absolute  value  of  the-  difference,  |x(t)-y(t)|,  is  a 
measure  of  closeness.  For  vectors  a  suitable  definition  is  the 
maximum  over  all  i  of  |x^(t)-y^(t) j  where  x^(t)  and  y^(t)  are  the 
i—  components  of  X(t)  and  Y(t)  respectively.  This  will  be 
written  as  |X(t)-Y(t)|.  For  vector  valued  functions  defined  over 
[o,cr]  a  measure  of  proximity  may  be  defined  as  the  maximum  over 
all  te  [o,  a]  of  [X(t)-Y(t)|.  This  is  denoted  by  ||X-Y||  or  simply 
by  | ] X— Y | |  if  the  interval  of  interest  is  [o,coJ.  A  function  such 
as  this  which  satisfies  certain  technical  requirements  is  known  as 
a  norm.  This  is  by  no  means  the  only  possible  way  of  defining  a 
norm  and  although  this  particular  norm  will  always  be  used  for  the 
input  space  it  will  be  necessary  to  consider  other  types  of  norms 
for  the  output  set  in  order  to  accurately  characterize  the  trans¬ 
formations  under  consideration. 

Return  now  to  the  system  (1.2).  Let  S  =  C,  the  class  of  con¬ 
tinuous  functions  defined  un  [0,00]],  and  suppose  the  above  defini¬ 
tion  of  norm  is  used.  Then,  as  will  be  shown  in  Chapter  II,  if  the 
eigenvalues  of  A  have  negative  real  parts  then  the  system  (1.2)  de¬ 
fines  a  continuous  mapping  of  C  into  C.  That  is  to  say,  if  the 
eigenvalues  of  A  have  negative  real  parts  then  for  any  given  e  >  0 
there  exists  a  6  >  0  such  that  if  | } X— X f f |  <6  then  | |Y-Y' [ |  <  e. 
Here  again  Y  is  the  image  of  X  and  Y'  is  the  image  of  X'.  The  con¬ 
dition  that  the  eigenvalues  of  A  have  negative  real  parts  is 
necessary  and  sufficient  for  the  asymptotic  stability  of  the 
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differential  equation  (l.2a)  and  thus  for  this  particular  trans¬ 
formation  asymptotic  stability  implies  continuity.  It  is  important 

19* 

and  useful  to  know  that  the  total  stability  theorem  implies  that 
this  is  also  true  for  a  wide  class  of  time -invariant  nonlinear 
systems,  provided  that  Z(0)  is  sufficiently  close  to  the  critical 
point  and  that  the  inputs  are  sufficiently  small. 

The  inverse  equation  (1.3)  defines  a  continuous  mapping  of  € 
into  C  if  the  eigenvalues  of  (A-BD  ^C)  have  negative  real  parts. 

If,  on  the  other  hand,  there  exist  eigenvalues  of  (A-BD  ^C)  with 
positive  real  parts  then  there  may  be  output  functions  which  are 
finite  but  which  require  an  infinite  input  to  produce  them.  In 
terms  of  the  approach  taken  here  such  a  system' would  be  described 
as  one  which  does  not  have  a  continuous  inverse. 

Another  reason  that  the  inverse  may  fail  to  be  continuous  in 
terms  of  the  characterization  of  the  response  space  given  above  is 
that  the  inverse  system  may  involve  derivatives  of  Y.  'Consider  the 
scalar  equation 

y(t)  +  by(t)  =  x(t)  (1.4) 

If  b  is  positive  then  when  viewed  as  a  mapping  of  S  into  S 

•A.  J 

(1.4)  is  continuous  provided  both  and  are  taken  to  be  C, 
Difficulties  arise ,  however,  when  considering  the  inverse  mapping. 
The  fact  is  that  by  making  [ |y-y"| |  small  one  cannot  insure  that 

’'Superscripts  refer  to  the  references  found  immediately  following 
Chapter  V, 
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the  inverse  images  of  y  and  y'  will  be  close.  Thus  (1,4)  does  not 


have  a  continuous  inverse  if  S  -  C, 

y 

The  problem  in  this  case  is  that  the  value  of'x(t)  depends  on 
y(t)  as  well  as  y(t)  whereas  a  restriction  in  the  value  of  | ] y— y ’ | | 
gives  no  control  over  the  value  of  [jy-y'||.  The  solution  to  this 
problem  is  to  define  a  norm  which  gives  control  over  both  |  j'y-y1 1  | 
and  | [y-y1 | | .  It  should  be  clear  that  in  more  general  situations 
control  over  higher  derivatives  may  be  required  as  well.  Let 
C  be  the  set  of  functions  defined  on  [o,coJ  which  have  k  con¬ 
tinuous  derivatives.  Define  |[X||k,  the  norm,  as  maximum  over 


all  i  from  zero  to  k  of  ||X^||  where  X^  is. the  i—  derivative 
of  X  with  respect  to  time.  In  terms  of  this  notation  equation 
(1.4)  defines  continuous  transformation  of  0  into  'C^  and  as  such, 
it  has  a  continuous  inverse.  The  ideas  which  we  have  sketched 
here  will  be  presented  in  a  more  orderly  fashion  in  Chapter  II. 


.th 


1. 4  Asymptotic  Controllability 

As  pointed  out  in  the  previous  section,  mappings  defined  by 
equations  such  as 

y(t)  +  y(t)  =  x(t)  -  x(t)  (1.5) 

do  not  have  continuous  inverses  because  their  inverse  equation  is 
unstable,  let,  such  systems  can  be  controlled  to  a  large  extent. 
For  example,  one  can  find  a  bounded  input  x  such  that  y(t)  is 
identical  to  some  desired  function  over  some  finite  interval  [o,  o J  . 
One  may  also  find  x(t)  such  that  y(t)  takes  on  a  given  value  at  a 
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given  time  or,  one  may  find  a  bounded  x(t)  such  that  y(t)  approaches 
any  desired  value  asymptotically.  Systems  having  this  last  pro¬ 
perty  are  especially  important  in  process  control.  Thus  the  fact 
that  a  system  does  not  have  a  continuous  inverse  does  not  imply  that 
the  system  cannot  be  controlled  at  all  but  rather,  it  merely  implies 
that  perfect  control  is  not  possible  over  the  entire  interval  [o,oq]  „ 
Although  we  will  not  attempt  to  define  exactly  what  the  capabi¬ 
lities  and  limitations  of  a  system  with  an  unstable  inverse  equation 
are,  we  will  study  the  conditions  under  which  steady-state  control 
is  possible.  If  a  system  can  be  controlled  in  the  steady-state  we 
will  call  it  asymptotically  controllable.  A  precise  definition 
will  be  given  in  (2.5).  In  what  follows  here,  we  will  try  to  give 
the  physical  motivation  for  that  definition. 

Since  any  reasonable  defiiLtion of  asymptotic  controllability  re¬ 
quires  that  the  limit  as  t  approaches  infinity  of  Y(t)  should 
exist,  it  is  clear  that  asymptotic  stability  of  the  differential 
equations  is  a  prerequisite.  It  is,  however,  not  sufficient  to 
insure  that  it  will  be  possible  to  find  an  X  such  that  X(t)  tends 
toward  a  preassigned  constant.  An  obvious  example  is  the  one- 
input,  two -output  system 

y-j^t)  +  y-^(t)  =  x(t)  (1.6a) 

y2(t)  +  2y2(t)  =  x(t) 

Although  these  equations  are  asymptotically  stable,  it  is  not 

■>  a  and  y^(t)  -->  b  as 
t  — >  oo  except  in  the  special  case  where  2a  K  b. 


possible  to  find  x  such  that  y.^(t) 


-10- 


At  the  same  time,  it  should  not  be  assumed  that  there  are  no 
one -input, two -out put  systems  which  are  capable  of  meeting  these 
requirements.  Consider  the  nonlinear  system 

y-|_(t)  =  (x(t))5  (1.7a) 

y^(t)  =  x(t)  (1.7b) 

A  little  experimenting  with  this  system ' convince s  one  that  by 
manipulating  the  single  innut  x(t)  it  is  possible  to  make  the  out¬ 
puts  assume  any  two  preassigned  steady-state  values.  This  behavior 
is  very  interesting  and  seems  to  be  present  in  most  systems  which 
incorporate  some  means  for  self-identification  or  adaption.  How¬ 
ever,  because  of  its  complexity,  it  will  not  be  discussed  here. 

From  a  more  formal  point  of  view  the  problem  of  determining 
the  conditions  under  which  it  is  possible  to  force  the  outputs  to  a 
preassigned  steady-state  value  may  be  put  into  our  general  frame¬ 
work  as  follows.  First  observe  that  we  are  only  interested  in  the 
limit  of  Y(t)  as  t  — >  oo  so  that  any  two  responses  such  that 

lim  (Y(t)  -  Y'(t))  =  0  are  equivalent.  In  fact,  this  is  an 
t  — >  oo  7 

equivalence  relation  in  the  precise  sense  and  as  such  it  partitions 

the  subset  of  C  for  which  lim  Y(t)  exists  into  equivalence 

t  — >  oo 

classes.  A  suitable  norm  for  this  set  of  equivalence  classes  is 

| lim  Y(t) |  =  | |Y| |  where  Y  represents  the  equivalence  class  con- 
t  — >  oo 

taining  Y(t) .  It  follows  that  the  conditions  under  which  it  is 
possible  to  achieve  steady-state  control  are  the  same  as  those 
which  guarantee  that  the  system  maps  C  onto  this  set  of  equivalence 


classes. 
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In  this  section,  and  those  preceding  it,  we  have  attempted  to 
provide  motivation  for  the  material  to  be  presented  in  Chapter  II. 
The  purpose  has  been  to  engender  a  point  of  view  which  may  be 
summarized  as  follows s  Once  the  initial  state  is  fixed  a  system  can 
be  viewed  as  a  mapping  from  a  function  space  S__  onto  a  second  set  S  . 
By  properly  selecting  the  norms  for  these  spaces  it  is  possible  to 
relate  many  of  the  important  problems  in  control  theory  to  the  study 
of  these  mappings. 


1.5  Background 

The  problem  of  determining  the  limitations  which  the  equations 
of  motion  impose  on  the  control  of  a  physical  process  has  been  studied 
from  many  points  of  view,  usually  with  the  term  controllable  being 

applied  to  systems  which  have  some  desired  property.  Thus  the  works 

49  12  23  3 

of  Smith*  ,  Eckman  ,  Kalman  ,  and  Antrosiewicz  ,  to  name  a  few, 

all  contain  different  definitions  of  controllability. 

In  his  study  of  multivariable  systems,  Mesarovic^5^1^"'' 

studied  and  quantified  the  concept  of  interaction  and  our  general 

approach  to  the  problem  of  determining  the  conditions  under  which 

a  multivariable  system  can  produce  a  desired  output  has  been  very 

much  influenced  by  his  work  in  this  area.  The  work  of  Kavanaugn^*^, 

Arrara^,  and  Freeman^45^^  should  also  be  mentioned  as  being  in  the 


same  spirit  as  this  investigation. 


Although  we  have  used  a  vector  differential  equation  in  re¬ 
presenting  the  equations  of  the  system  we  have  also  consistently 
placed  in  evidence  a  second  equation  relating  the  output  to  the 


-12- 


states.  This  is  often  not  done.  In  fact,  one  of  the  effects  of 
the  recent  trend  toward  modeling  systems  by  first  order  vector 
differential  equations  has  been  to  focus  attention  on  the  state, 
often  to  the  extent  of  leaving  the  outputs  undefined.  This  is 

evident,  for  example,  in  the  recent  work  of  Kalman,  Ho,  and 

£7  18  35  3 

Narandra  ,  Gilbert  ,  Marcus  and  Lee  ,  and  Antrosiewicz  ,  where 

controllability  is  defined  in  terms  of  the  state  rather  than  the 

outputs. 

To  emphasize  the  difference  between  the  point  of  view  taken  by 
this  last  group  of  authors  and  the  approach  taken  here,  consider  the 
following  definition  from  reference  27:  "A  system  is  controllable 
if  any  initial  state  can  be  transferred  to  any  desired  state  in  a 
finite  length  of  time  by  some  control  action.”  It  has  been  shown 
that  a  necessary  and  sufficient  condition  for  the  equation  (1.2a)  to 
be  controllable  is  that  the  matrix  (B,AB,  ...  Ap  ^TB)*  be  of  rank  p 
where  p  is  the  dimension  of  Z(t).  According  to  such  a  definition  the 
system  defined  by  (1.6)  is  controllable.  This  clearly  points  out 
that  controllability  does  not  imply  that  one  can  select  the  inputs 
so  as  to  obtain  an  arbitrary  response.  What  it  does  insure  is  that 
one  can  select  the  innuts  so  as  to  obtain  an  arbitrary  value  of  the 
state  (or  output)  at  some  one  point  in  time. 

For  those  cases  in  which  the  system  equations  are  of  the  form 
of  (1.2)  and'  D  is  zero  Kalman  has  shown  that  the  concept  of  ob¬ 
servability  has  certain  applications.  Without  attempting  a  detailed 

* - 

See  section  2.1  for  an  explanation  of  this  notation. 


explanation,  the  condition  under  which  a  system  of  the  form  (1.2) 
with  D  =  0  is  observable  is  that  (cF  ,A^(F , . .  ,A^  be  of  rank  p 
where  again  p  is  the  dimension  of  Z(t).  Since  the  system  (1,6)  can 
be  represented  as  a  controllable  and  observable  system  it  is  clear 
that  it  is  possible  to  have  systems  which  are  both  controllable  and 
observable  in  the  sense  of  Kalman  but  which  do  not  correspond  to 
systems  which  would,  for  example,  make  suitable  servomechanisms. 

Similar  comments  apply  to  most  of  the  other  definitions  of  con¬ 
trollability  which  have  been  proposed  in  connection  with  optimal 

control  problems.  The  terms  "accessibility"  and  "reachable"  which 
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appear  in  the  papers  of  Hermann  and  Roxin  also  fall  into  this 
class. 


The  various  properties  of  the  solutions  of  differential  equa¬ 
tions  which  are  used  in  Chapter  II  may  be  found  in  any  of  the 


standard  texts 


10,33,42 


Particular  facts  which  will  be  required 


include  the  usual  existence  and  uniqueness  theorems  and  the  fact 
that  if  the  right  hand  side  has  continuous  partial  derivatives  with 
respect  to  the  state  vector  then  the  solutions  are  continuous 
functions  of  the  initial  state.  We  also  use  the  fact  that  the 
solution  of  the  linear,  time -invariant  differential  equation 


Z(t)  =  AZ(t)  +  BX(t) 


can  be  written  as 


Z(t)  =  eAtZ(0)  +  eAt  rVAsBX(s)ds 

4  rs 


(1.9) 
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T'he  idea  of  looking  at  a  nonhomogeneous  differential  equation 

as  defining  a  mapping  between  function  spaces  is  not  discussed  in 

37 

the  above  references.  It  has  been  used  by  Massera  and  Schaffer  , 

for  example,  in  their  study  of  stability,  however,  not  with 'the  same 

norms  or  with  the  same  objective.  The  work  in  section  2.4  depends 

strongly  on  lemma  2  which  is  closely  related  to  certain  results  on 

total  stability^ 5 ^ .  Our  proof  of  this  lemma  is  based  on 

33 

Lefschetz's  proof  of  Dychman's  theorem  . 

1.6  Summary  of  Results 

In  Chapter  II  the  conditions  under  which  a  linear,  time-invariant 
system  such  as  (1.2)  has  an  inverse  is  derived.  Explicit  formulas 
are  given  for  the  inverses  in  a  number  of  special  cases,  including 
the  most  general  single -input ,  single-output  system.  One  general 
result  concerning  n-input,  n-output  systems  of  the  form  of  (1,2)  is 
given  by  theorem  3  which  asserts  that  if  (D,CB,CAB, . . ,CAP  )  is  of 
rank  n  then  it  will  be  possible  to  solve  (1.2)  for  X  in  terms  of  Y 
and  its  derivatives.  It  should  be  noted  that  only  in  the  case 
where  n  =  1  is  this  condition  implied  by  the  controllability  and 
observability  conditions  cited  previously. 

If  the  properties  derived  for  linear  systems  are  to  be  useful 
it  is  important  to  show  that  small  nonlinearities  do  not  disturb 
them.  Some  results  in  this  direction  are  contained  in  section  2.4 
where  it  is  shown  that  in  the  neighborhood  of  zero  conclusions  about 
systems  of  the  form 
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(1.10a) 

(1.10b) 


Z(t)  =  AZ(t)  +  BX(t)  +  BQ(X(t),Z(t)) 

Y(t)  =  CZ(t) 

can  be  derived  from  an  examination  of  the  linear  part  provided 
Q(X(t)jZ(t))  satisfies  certain  smoothness  requirements  and  contains 
no  linear  terms. 

Insofar  as  steady-state  control  is  concerned  our  principle  re¬ 
sult  is  that  if  CA  +  D  singular  then  the  system  (1.2)  is  not 
asymptotically  controllable.  The  final  section  of  Chapter  II  in¬ 
terprets  the  previous  results  in  terms  of  transfer  matrices. 

In  Chapter  III  the  effects  of  feedback  are  analyzed.  It  is 
shown  that  if  a  system  is  linear  then  under  mild'  restrictions 
linear  derivative  feedback  does  not  affect  irreducibility,  non¬ 
singularity  3  or  asymptotic  controllability.  It  is  also  shown  that 
nonlinear  feedback  of  a  certain  type  does  not  affect  the  stability 
of  the  inverse  equation.  The  problem  of  absolute  stability  is 
examined  and  it  is  shown  that  stability  of  the  inverse  equation  is 
a  requirement  for  absolute  stability. 

An  additional  use  for  the  inverse  equation  is  developed  in 
Chapter  IV  where  some  nonstandard  time -optimal  control  problems 
are  discussed  and  solved.  It  is  shown  how  the  structure  of  the 
time -optimal  input  depends  on  the  inverse  equation  and  how  the  in¬ 
verse  equation  can  be  used  to  obtain  a  parametric  representation  for 
the  optimum  input. 


CHAPTER  II 


TRANSFORMATIONS  DEFINED  BY  DIFFERENTIA!  EQUATIONS 
2,1  Introduction 

Our  objective  is  to  study  a  class  of  transformations  defined 
by  nonhomogeneous  differential  equations.  The  basic  idea  is  the 
following.  Assume  we  are  given  a  first  order  vector  differential 
equation 

Z(t)  =  F(X(t),Z(t))  (2.1a) 

and  a  function  Y(t)  defined  in  terms  of  Z(t)  and  X(t) 

Y(t)  -G(X(t),Z(t))  (2.1b) 

If  for  a  given  value  of  Z(0),  the  differential  equation  has  a 

unique  solution  for  all  functions  in  a  given  class  Sx  then  the 

system  (2.1)  maps  this  class  of  functions  into  a  second  set  S  .  By 

Y 

introducing  norms  in  these  function  spaces  it  is  possible  to  study 
the  continuity  and  invertibility  of  these  mappings  in  a  systematic 
way.  It  may  be  observed  that  ordinary  point-transformations  of  the 
type  Y(t)  =  G(X(t) )  can  be  thought  of  as  being  a  special  case  and 
are  perhaps  the  simplest  examples  of  the  type  of  transformations  we 
are  considering.  The  motivation  for  this  study  can  be  found  in 
later  chapters  where  applications  to  various  problems  in  automatic 
control  are  discussed. 

Lower  case  letters  refer  to  scalars,  upper  case  to  vectors  and 

T 

matrices.  If  A  is  an  arbitrary  vector  or  matrix  then  A  is  its 
transpose,  if  A  is  square  and  nonsingular  A  ^  denotes  its  inverse. 
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The  1—  element  of  a  vector  X  will  be  written  as  x^,  the  ij— 

element  of  a  matrix  A  will  be  written  as  a.  Occasionally  A  will 

"L  J 

be  written  as  (a^..) .  If  A^,A^,...A^  are  a  set  of  matrices  or 
vectors  all  having  the  same  number  of  rows  then  (A^jAg, , .  .A^)  de¬ 
notes  the  matrix  consisting  of  all  the  columns  of  A-^Ag, . .  .A^. 

Thus 5  if  the  A^  are  p  by  p  matrices  then  the  matrix  (A-^jA^j . „ .A^) 
is  p  by  np„  For  scalars  [ a f  denotes  the  absolute  values,  for 
vectors  and  matrices  I A I  denotes  the  sum  of  the  absolute  values  of 
the  elements  of  A.  The  determinant  of  A  will  be  written  as  det  A. 

2.2  Function  Spaces 

Let  En  denote  Euclidean  n-space  and  let  a  and  I  denote  the 
intervals  [ojct]  and  [o,oo[]  respectively.  The  set  of  all  bounded 
continuous  functions  with  domain  a  and  range  E^  will  be  denoted  by 
Cn(  a) .  The  set  of  bounded  functions  with  domain  I  and  range  E^ 
which  are  continuous  on  every  finite  interval  contained  in  I  will 
be  denoted  by  C  .  For  the  function  space  U^ct)  we  use  the  norm 

I ]X| f  =  sup  |X(t)[  (2.2) 

CT  tea 


In  the  sequel  we  shall  always  assume  that  a  is  finite  and  nonzero 
without  specifying  it  each  time.  For  the  norm  is  defined  in  the 
same  way  and  written  as 


-18- 


functions  which  have  at  least  k  continuous  derivatives  on  a  and  let 

k  k  /  \  k 

G  be  the  corresponding  subset  of  C  .  If  XeC  (a)  or  C  we  denote 
n  16  n  nx—  n 

its  k—  derivative  by  For  our  purposes  it  is  indispensable 

to  have  norms  for  the  spaces  C^(o)  and  which  reflect  the  magni¬ 
tude  X  and  its  first  k  derivatives.  With  this  in  mind  we  define 

the  G^(ct)  norm  as 

n  — 

[[*[!*=  sup  f  rx(l)(t) I [  (2.4) 

°<i<k 

The  norm  for  is  written  as  f |x| and  is  given  by  (2.4)  with  cr 
replaced  by  I. 

To  see  that  (2,4)  actually  defines  a  norm  notice  that  i) 

| |X| is  nonnegative  and  vanishes  if  and  only  if  X(t)  =  0  on  a, 
ii)  [|aX|[k  =  a  •  ||X([^  and  iii)  [X^  (t)  -Y^  (t)  |  <  |X^(t)|  + 
|  y(i)  (t )  [  for  a2.i  i  and  t  and  hence  it  follows  that  the  triangle 
inequality  is  satisfied.  It  is  manifest  that  these  spaces  are 
linear  so  that  (2.4)  makes  C^(ct)  a  normed  linear  space.  When  we 
speak  of  the  continuity  of  a  mapping  between  two  such  spaces  it  is 
always  continuity  with  respect  to  the  norm  topology  which  we  have 
in  mind. 

2.3  Linear  Time -Invariant  Systems 

Let  X(t)  and  Y(t)  be  n-vectors*  let  Z(t)  be  a  p-vector*  and 


suppose 


Z(t)  =  AZ(t)  +  BX(t) 
X(t)  =  €Z(t) 


(2.5a) 

(2.5b) 
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where  A,  B,  and  'C  are  constant.  It  is  well  known  that  if  XeC  (cr) 
then  for  a  given  value  of  Z(0)  there  exists  a  unique  ZeC!^  satisfy¬ 
ing  (2,5a)  and  that  Z(t)  is  given  by 


Z(0)  +  /'teA^t_sW(  s)ds  (2.6) 

*C\ 


Therefore,  if  Z  and  Z>  are  the  images  of  X  and  X’  respectively 


then  ||Z-Z»|j  <  | [X-X' | |  .  cr|  |eA^B [  |'  .  Using  this  and  (2.5a)  it 

follows  that  f  |Z~Z»  f  f1  <  ((1  +  (At).  a||eAtB[|  +  |B[)  |  ]X-X«  |  j'  . 


Since  e  is  bounded  on  any  finite  interval  this  shows  that  for 
finite  a  (2.5a)  defines  a  continuous  mapping  of  €n(a)  into  €^(a) . 

A  sufficient  condition  for  (2,5a)  to  define  a  continuous 
mapping  of  into  is  that  the  eigenvalues  of  A  have  negative 
real  parts.  This  is  easily  established  from  the  fact  that  if  the 
eigenvalues  of  A  have  negative  real  parts  then  there  exists  a  and 
\  >  0  such  that  [ feA^B ( [  <  ae  This  together  with  (2.6)  gives 


|Z~Z«t|  <  |  fX-X*  t  |  •  t  \j^°  ae_^^t~S^ds|  [  (2.7) 


<  !  jX-X'l | aA 


From  this  and  (2.5a)  the  continuity  of  the  mapping  of  into 
is  easily  established. 

If  no  additional  restrictions  are  imposed  it  is  possible  that 
for  some  values  of  Z(0)  (2.5a)  may  define  a  continuous  mapping  of 
into  <£  even  though  A  has  eigenvalues  with  zero  or  positive  real 
parts.  To  eliminate  this  pathological  behavior  and  to  prevent 
similar  difficulties  from  arising  when  discussing  the  relationship 
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between  X  and  Y  we  now  assume :  i)  That  of  the  np  columns  in  the 

matrix  (BjAB,..,A^  B)  there  are  p  linearly  independent  ones,  and 

T  T  T  T  “p— I  T 

ii)  That  of  the  np  columns  in  the  matrix  (C  ,A  C  ,  ...A  y  C  )  there 

are  p  linearly  independent  ones. 

Systems  having  this  property  will  be  called  irredu cible .  The 
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role  of  these  assumptions  has  been  studied  by  Gilbert  and  Kalman 
and  it  has  been  shown  that  any  linear  time -invariant  system  has  an 
irreducible  representation.  If  (2.5b)  is  replaced  by  Y(t)  =  CZ(t)  + 
DX(t)  then  we  will  say  that  the  system  is  irreducible  if  the  corres¬ 
ponding  system  obtained  by  setting  D  =  0  is  irreducible.  For  our 
purpose  the  importance  of  irredu cibility  stems  from  the  following 
lemma  which  asserts  that  if  the  system  is  irreducible  then 
asymptotic  stability  and  continuity  are  equivalent. 

Lemma  1 :  Assume  that  the  system  (2.5)  is  irreducible.  Then;' 
i)  The  differential  equation  (2.5a)  defines  a  continuous  mapping 
of  the  input  function  space  into  the  state  function  space  if 
and  only  if  the  eigenvalues  of  A  have  negative  real  parts,  ii)  The 
system  (2.5)  defines  a  continuous  mapping  of  the  input  function 
space  into  the  output  function  space  C  if  and  only  if  the 
eigenvalues  of  A  have  negative  real  parts. 

Proofs  i)  The  sufficiency  of  the  condition  on  the  eigenvalues 
has  been  shown  above.  To  show  necessity  consider  first  the  mapping 


of  Cn  into  C^.  From  equation  (2.6)  it  follows  that  the  mapping 
will  not  be  continuous  unless  [e^Bf  tends  to  zero  with  increasing 
time.  Now  suppose  A  has  an  eigenvalue  with  a  nonnegative  real  part. 
This  implies  that  |e  |  does  not  approach  zero  as  t — xo  .  More- 
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overs  since  (B  ,,ABj  „ .  ,AP  i?)  is  of  rank  p  the  assumption  that  A  has 
an  eigenvalue  with  a  nonnegative  real  part  implies  that  |e^^(BsABs 
...AP"1B)|  does  not  approach  zero  as  t— — >oo  ,  But  since  A  and  eA^ 
commute  this  implies  that  f  (e^BsAe^^s , .  .AP  [  does  not 

approach  zero  and  hence  that  |e^^B[  does  not  approach  zero*  ii)  Now 
consider  the  mapping  of  X  into  Y.  The  sufficiency  follows  from  the 
fact  that  Y  depends  continuously  on  Z,  To  prove  the  necessity 
notice  that  since  Y(t)  is  given  by 


Y(t)  -  CeAtZ(0)  +  rtCeA^t“s^BX(s)ds  (2-8) 


At 

it  follows  that  the  mapping  will  not  be  continuous  unless  Ce  B 
approaches  zero  as  t— — >oo ,  It  remains  to  prove  that  this  cannot 
happen  if  A  has  a  nonnegative  eigenvalue .  To  show  that  CeAtB  does 

not  approach  zero  if  A  has  a  nonnegative  eigenvalue  we  use  the  fact 

A+  T  T  T 

that  e  n  does  not  approach  zero  and  the  fact  that  (C  }A  C 

Tid*3!!  T  TXT 

A'p  'C  )  is  of  rank  p,  Together  these  imply  that  (C  ,k  C 

Tp_i  T  T  Ah  ‘‘At 

A  F  C  )  e  B  does  not  vanish.  Using  the  fact  that  A  and  e 

commute  it  follows  that  Ce^^BjAB;  ...AP  ^B)  does  not  approach  zero. 

Now  suppose  CeAtB  — — >0  then  because  CeAtB  is  a  sum  of  exponential 

terms  it  follows  that  if  CeAtB  - — ->0  then  d/dtCeAtB  .=  CAeAtB— -*»  ~  0 . . 

Continuing  in  this  way  it  is  seen  that  the  assumption  that 

CeAtB  - — >0  leads  to  a  contradiction  of  the  statement  that 

|  CeA^’(B,AB  ,  o  ,  ,Ap-1B)  [  does  not  approach  zero,  Q.E.D. 

It  is  also  true  that  irredu cibility  implies  that  no  linear 

combination  of  the  z.(t)  vanishes  for  all  possible  choices  of  X(t) . 
1 

To  prove  this  assume  the  contrary,,  i.e.*  assume  there  exists  a 
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constant,  nonzero  p-vector  F  such  that  F  Z(t)  s  0.  From  (2.5a)  it 

T  T  T 

follows  by  successive  differentiation  that  F  B,F  AB,...F  AF  B  all 
vanish.  This,  however,  contradicts  the  assumption  that  the  matrix 
(b,ab,..bap“1b)  contains  p  linearly  independent  columns. 

In  general  the  mappings  discussed  in  lemma  1  will  be  continuous 
with  respect  to  stronger  topologies  as  well.  To  see  this  consider 
the  following  equations  which  follow  from  (2.5)  by  successive 
differentiations  (assuming  momentarily  that  XeEp  *0 . 

■CZ(t)  =  Y(t) 

€AZ(t)  *=  Y^(t)-CBX(t) 

GA2Z(t)  -  Y^2^(t)-CABX(t)-CBX^1^(t)  (2.9) 

ooeeoo*«*oooo«»eoBoeo  bo 

CA^Zft)  =  Y<'P*'1^(t)-ap"2BK(t)-CAp“5BX^1^(t)...CBX^p"2\t) 

If  CA^B  =  0  for  i  =  0,  1,  ...  k-2  then  it  follows  that  Y^k'(t) 
depends  continuously  on  Z(t)  and  X(t)  and  that  Y^  (t)  depends 
continuously  on  Z(t)  for  0  <  i  <  k.  It  follows  that  if  the  above 
condition  on  GA^  is  satisfied  then  there  exists  c  such  that  for 
finite  a  | [Y-Y1 f [k  <  c[  [X-X®  [  [^.  If  the  eigenvalues  of  A  have 
negative  real  parts  then  this  condition  on  GA^  implies  that  there 
exists  a  c  such  that  I|Y-Y’[fk<  cf [X-X! | [ . 

Equation  (2.9)  also  asserts  that  if  =  0  for  i  =0,  .1,  ... 

k-2  then  it  will  be  possible  to  solve  for  X(t)  in  terms  of  Y^  (t) 
and  Z(t)  if  and  only  if  CAn~^B  is  nonsingular.  The  if  part  is 
obvious  and  the  only  if  follows  by  noting  that  if  CAk  ^B  is 
singular  then  any  solution  for  X(t)  will  necessarily  involve  higher 
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derivatives  of  Y(t).  The  following  theorem  subsumes  most  of  the 
informal  statements  on  continuity  given  thus  far  and  gives  necessary 
and  sufficient  conditions  for  the  existence  of  one  type  of  an  inverse 
mapping  both  on  a  finite  interval  and  in  the  infinite  case. 

Theorem  Is  Consider  the  system  (2.5)  where  A,  B,  and  C  are 
constant  and  X(t),  Y(t)3  and  Z(t)  are  n,  n,  and  p-vectors  respec¬ 
tively,  Assume  that  the  system  is  irreducible.  Then  for  an 
arbitrary,  fixed,  initial  value  Z(0)  it  follows  that; 

i)  Equation  (2.5)  defines  a  continuous  mapping  of  {^(ct)  into 
Ck(a)  if  and  only  if  CAiB  =  0  for  i  =0,  1,  ...  k-2. 

ii)  If  (2.5)  maps  C  (ct)  into  Ck(a)  then  it  has  a  continuous 
inverse  which  maps  Ck(cr)  into  Cn(a)  if  and  only  if  CAk  is  non¬ 
singular, 

iii)  Equation  (2.5)  defines  a  continuous  mapping  of  C  into 

jc 

if  and  only  if  the  eigenvalues  of  A  have  negative  real  parts  and 
CAiB  =  0  for  i  =  0,  1,  ...  k-2, 

iv)  If  (2.5)  maps  Cn  into  Ck  then  it  has  a  continuous  inverse 
k  k“l 

■which  maps  into  if  and  only  if  CA  B  is  nonsingular  and 
A-B(CAk“:LB)"  ^CAk  has  p=nk  eigenvalues  with  negative  real  parts. 

Proofs  i)  From  (2.9)  it  follows  that  (2.5)  maps  C  (ct)  into 
Ck(a)  if  and  only  if  CA^B  =  0  for  i  =0,  1,  ...  k-2.  To  see  that 
this  mapping  is  continuous  let  X(t) ,  Y(t) ,  and  Z(t)  be  one  solution 
of  (2.5)  and  let  X*(t)s  Y!(t),  and  Z!(t)  be  a  second  solution  such 
that  Z(0)  =  Z»(0).  Then  as  shown  above  ||Z-Z'||  <  c| |X=X# | |  for 
some  c.  From  (2.9)  it  follows  that  [  |  Y^-Y’  1 1  ^  <  c^ |  }  Z— Z 5 1  |  ^, 
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for  i  =  0,  1,  ...  k-1  and  that  [  [yM-Y' M  1 1  <  c^||Z-Z'[|  + 

c f | | X— X ' [ |  .  The  result  now  follows  from  the  definition  of  the 

'Ck(cr)  norm. 

n  — 

ii)  If  (2.5)  maps  C^ct)  into  Ck(cr)  then  from  i)  it  follows 
that  'CA^  -  0  for  i  =  0,  1,  ...  k-2. Therefore  CAk_1BX(t)  =  Y^(t)- 
CAkZ(t).  If  T  ^  =  CAk  is  nonsingular  then  (2.5)  yields 

Z(t)  =  (A-BTCAk) Z(t)  +  BTY^k'(t)  (2.10a) 

X(t)  =  TY^(t)  -  TCAkZ(t)  (2.10b) 

From  i)  it  follows  that  the  mapping  of  Ck(a)  into  (^(ct)  defined  by 

these  equations  is  continuous.  However,  if  T  ^  is  singular  then, 

as  noted  above,  it  will  be  impossible  to  solve  for  X(t)  in  terms  of 

Y^(t)  and  Z(t)  without  introducing  higher  derivatives  of  Y(t)  and 

k 

hence  the  inverse  will  not  be  defined  for  some  elements  of  C  (cr), 

iii)  The  necessity  of  the  condition  on  CA1!}  follows  exactly  as 

in  i) ,  The  necessity  of  the  condition  on  the  eigenvalues  follows 

from  the  assumed  irreducibility  and  lemma  1.  The  sufficiency 

follows  as  in  i)  making  use  of  (2.7). 

iv)  If  (2.5)  maps  G  (a)  into  <Ck(a)  then  from  iii)  it  follows 

that  'CA^B  =  0  for  i  =0,  1,  ...  k-2.  The  necessity  of  T  ^  = 

CA^  'kB  being  nonsingular  follows  as  in  ii) .  Assuming  this  we  first 

k-1 

establish  that  the  nk  rows  of  the  matrices  C,CA,...CA  are  all 
linearly  independent.  This  will  be  done  by  showing  that  no  non¬ 
trivial  linear  combination  of  these  rows  vanish.  Let  IL  be 
arbitrary  n  x  n  matrices.  From  our  previous  remarks  it  follows 


"25= 


that  (M^C+MgGA, .  ,M.kCAk  ^)B  =  M^CA^”^B0  Since  the  right  hand  side 

of  this  equation  vanishes  if  and  only  if  does  it  follows  that 

no  linear  combinations  of  rows  involving  CAk  ^  vanishes.  Repeating 

this  argument  on  the  equation  (M^'C+MgCAj ,  „  ^CAk  ^)AB  E  Mj^CA^  ^*B 

k=2 

we  obtain  the  same  conclusion  about  CA  etc. 

Let  P  be  any  nonsingular  p  x  p  matrix  whose  first  n  rows  are 
the  rows  of  C3  whose  2nd  n  rows  are  the  rows  of  CA,,,  and  whose  k— 
n  rows  are  the  rows  of  CAk  \  Define  W(t)  as  PZ(t)  and  note  that 
from  (2,9)  it  follows  that  the  first  n  components  of  ¥  are  the  n 
components  of  Y^the  second  n  are  the  n  components  of  Y^„„»  and 
the  k™  n  components  are  the  components  of  Y^k  ^  „  With  this 
change  of  variables  the  system  (2,10)  becomes 

W(t)  -  P(C-BTCAk)P“1W(t)  +PBTY^(t)  (2,11a) 

X(t)  -  =TCAkp"1¥(t)  +  TY^(t)  (2.11b) 

It  is  convenient  to  partition  the  differential  equations 
(2.,  11a)  into  two  parts  as  follows.  Let  W1  (t)  denote  the  first  nk 
components  of  W(t)  and  ¥^(t)  the  remaining  p-nk.  Let  G^  denote  the 
first  nk  rows  of  FBT  and  let  denote  the  remaining  p-nk. 

Partition  P(A-BTCAk)P  ^  into  F,^s  F^23  F^,,  an(^  ^22  S°  ^at  (2.11a) 
becomes 

W^t)  -  Fr¥1(t)  ■>  F_  gWg(t)  +GxY(k)(t)  (2,12a) 

¥2(t)  “F^W^t)  +F22¥g(t)  +G2I(k)(t)  (2,12b) 

Since  G^  equals  the  first  nk  rows  of  PB(CAk  ^B)  A  it  follows 
from  the  definition  of  P  and  the  nature  of  CA^B  that  the  first 


“26- 


n(k-l)  rows  of  vanish  and  the  remaining  n  form  Sis  SHN  identity 
matrix  I  ,  From  the  known  relationship  between  ¥  and  W  it  follows 
that  F^g  =  0  and  F^  has  the  form 


0 

I 

n 

0  ... 

0 

0 

0 

“^n 

0 

0 

0 

0  ... 

I 

0 

0 

o  ... 

0 

(2.13) 


Because  F^g  is  zero  the  set  of  eigenvalues  of  P(A-BTCAk)P“"L  is  the 
union  of  the  sets  of  eigenvalues  of  F^  and  F^.  Since  \  =  0  is 
an  eigenvalue  of  F^  of  multiplicity  nk  it  follows  that  the  only 
nonzero  eigenvalues  of  P(A-BTCAk)P  ^  are  the  nonzero  eigenvalues  of 
F 

22* 

If  ||Y||k  is  finite  then  it  follows  that  is  finite  as  well. 
From  this  and  (2.12b)  it  follows  that  (2,11)  defines  a  continuous 

Jj 

mapping  of  into  provided  that  all  the  eigenvalues  of  F^  have 
negative  real  parts.  Since  the  eigenvalues  of  matrix  are  un¬ 
affected  by  a  similarity  transform  it  follows  that  all  the  eigen- 
values  of  F  have  negative  real  parts  if  and  only  if  A-BTCA  has 
p-nk  eigenvalues  with  negative  real  parts.  This  shows  the 
sufficiency  of  the  condition  onF^^, 

As  a  first  step  in  proving  the  necessity  of  this  condition 
on  the  eigenvalues  oi  A-BTCAk  we  will  show  that  (2.10)  is  irreduc¬ 
ible,  To  do  this  it  suffices  to  show  that  the  ranks  of 
(BT,(A-BrG4k)BT,...(A-BTG4k)p“1BT)  and  (Q, (A-BTCAk)TQ<, . . . 
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(A“BTCA^)^^  ^Q)  are  both  p  where  Q  =  (TCAk)^.  Consider  the  first 
of  these.  If  this  matrix  is  not  of  rank  n  then  there  exists  a  non- 
zero  p-vector  F  such  that  FT(BT3(A-BTCAk)BTs. . . (A~BTCAk)P“1BT)  «  0. 
But  this  implies  that  F^BT  =  0  and  hence  that  F^B  -  0.  Also  for 
such  an  F  .,F^(A-BTCAk)BT  »  0  and  therefore  F^AB  =  0.  Continuing  in 
this  way  we  see  that  F^(BSAB , , , .A?  ^B)  -  0  which  contradicts  the 


assumptions  that  the  original  system  was  irreducible. 

To  show  that  the  second  of  these  matrices  is  of  rank  p  we 
show  that  for  no  nonzero  p-vector  F  is  F^(Qs(A-BTCAk)  Q,.,0 
(A~BTCA^)T^  1Q)  =  0.  Suppose  it  is.  Then  TCAkF  *>  0  and  con¬ 
sequently  CAkF  =  0,  Since  FTCAk(A-BTCAk)F  =  0  it  follows  that 
TCAk(A-BTCAk)F  -  TCAA^F  «  0  and  in  general  that  CA^^F  «  0.  But 
if  this  were  the  case  then  A^F  would  be  a  nonzero  vector  such  that 
(AkF)^(C^JA^C^  _, , ,  0A^  ^C)  ~  0  which  contradicts  the  assumption  that 
the  system  (2,5)  was  irreducible. 

It  is  easily  verified  that  if  (2,10)  is  irreducible  then 
(2„ll)  is  as  well.  It  remains  only  to  show  that  this  irreducibility 
implies  that  for  some  choice  of  Js  regardless  of  how  small  imr 

is  required  to  bes  Z'(t)  —>  oo  ,  If  F^^  has  any  eigenvalues  with 

F  ■fc  X'fc 

positive  real  partss  then  it  follows  that  fe  22  [  >  ae  for  some 
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QG_Y^(t)  =  Q  Y(t)  +  +  Q,Y^(t).  If  Q  is  nonzero 

(L  0  JL  K  O 

then  clearly  QW^(t)  grows  without  bound  if  Y(t)  is  chosen  such  that 

Q  Y(t)  >0.  If  Q  is  zero  then  it  follows  that  QW0  can  be  ex- 
0  0 

pressed  in  terms  of  W^(t)  simply  by  integrating  the  equation 
QWg(t)  =  QF^gW^(t)  +  QGgY^k^(t).  But  this  contradicts  the  proven 
fact  that  the  inverse  system  is  irreducible  and  completes  the  proof. 

This  theorem  can  be  generalized  in  various  ways.  One  such 
generalization  concerns  the  system 

Z(t)  =  AZ(t)  +  BX(t)  (2.14a) 

Y(t)  =  CZ(t)  +  DX(t)  (2.14b) 

which  is  of  interest  in  certain  optimal  control  problems.  If  D 
is  zero  then  this  is  just  the  system  discussed  in  theorem  1.  If 
D  is  nonzero  the  following  theorem  applies. 

Theorem  2;  Consider  the  system  (2.14)  where  A,  B,  C,  and  D 
are  constant,  D  is  nonzero,  and  X(t) ,  Y(t) ,  and  Z(t)  are  n,  n,  and 
p=vectors  resoectively.  Assume  that  the  system  is  irreducible. 

Then  for  an  arbitrary,  fixed,  initial  value  Z(0)  it  follows  that 

i)  Equation  (2.14)  defines  a  continuous  mapping  of  C^ct) 
into  'C  (ct)  . 

ii)  The  system  (2.14)  has  a  continuous  inverse  which  maps 
(C n(a)  into  C  (a)  if  and  only  if  D  is  nonsingular, 

iii)  The  system  (2.14)  maps  into  if  and  only  if  the  eigen¬ 
values  of  A  have  negative  real  parts. 

iv)  If  (2.14  maps  into  C^  then  it  has  a  continuous  inverse 
which  maps  into  if  and  only  if  all  the  eigenvalues  of  A-CD  ^B 
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have  negative  real  parts. 

Proof  g  The  proof  of  i)  follows  from  the  equation  Y(t)  = 

CeA^Z(0)  +  /tCeA(t-s)BX(s)ds  +  DX(t) ,  If  D  is  nonsingular  then 

*o 

the  inverse  of  (2,14)  is 

Z(t)  =  (A-BD~1C)Z(t)  +  BD_1Y(t)  (2.15a) 

X(t)  =  -D-1BZ(t)  +  D“1Y(t)  (2.15b) 

so  that  if  D  is  nonsingular  the  inverse  exists  and  from  i)  it  is 
continuous.  If  D  is  singular  it  follows  from  the  assumption  of 
irreducibility  that  it  is  impossible  to  express  X(t)  in  terms  of 
Z(t)  and  Y(t)  without  introducing  higher  derivatives  of  Y(t).  The 
proof  of  iii)  and  iv)  proceed  in  the  same  way  as  the  corresponding 
parts  of  Theorem  1  provided  (2,10)  is  replaced  by  (2,15).  Q.E.D. 

Theorems  1  and  2  give  necessary  and  sufficient  conditions  for 

the  existence  of  a  certain  type  of  inverse,  that  is.,  either  one 

k  k 

which  maps  C^cr)  into  ^(ct)  or  else  one  which  maps  into  A 

more  general  problem  is  that  of  determining  the  circumstances  under 

which  it  is  possible  to  solve  for  X(t)  in  terms  of  Y(t)  and  any 

combination  of  its  derivatives.  That  is,  we  seek  conditions  under 

which  (2.5)  implies  a  system  of  the  form 


Z(t) 

-  A*Z(t)  ♦  B  Y(t)  +  BnY^1^(t)...B,Y^(t) 

0  1  K 

(2.16a) 

X(t) 

«  C*Z(t)  +  DQY(t)  +  D1Y^1^(t).0.DkY^k'>(t) 

(2.16b) 

Although  we  will  not  obtain  a  representation  for  the  inverse 
equations  in  the  general  case,  the  basic  question  of  existence  is 
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answered  by  the  following  theorem. 

Theorem  3 ;  Consider  the  system  (2.14)  where  A,  C.  and  D 
are  constant  (D  may  be  zero)  and  assume  X(t)>  Y(t),  and  Z(t)  are 
n^  iij  and  p-vectors  respectively.  Then  a  necessary  and  sufficient 
condition  that  it  be  possible  to  solve  this  system  for  X(t)  in  terms 
of  Z(t)j  Y(t) }  Y^(t)...  is  that  (D.GB^CAB, . .  .CA13"^)  be  of  rank 
n. 

Proof ;  The  sufficiency  follows  immediately  from  the  following 
equations  which  are  a  consequence  of  (2.14). 

Y(t)  =  CZ(t)  +  DX(t) 

Y^(t)  =  CAZ(t)  +  CBX(t)  +  DX^^(t) 

. . . . .  (2.17)' 

Y^(t)  =  CAPZ(t)+  CAP"1BX(t)  +  CA^P’"^BX^1^(t)...DX^P^(t) 

This  set  is  exhaustive  in  that  any  relationship  between  X(t)3  Z(t) 
and  Y(t)  and  its  derivatives  which  is  implied  by  (2.14)  can  be 
derived  from  (2.17)  without  using  (2.14).  If  (DyCB^CAB . . . ,CAP  ^B) 
is  of  rank  n  then  (2.17)  provides  n  equations  relating  X(t)  and  Z(t) 
and  Y(t)  and  its  derivatives.  If  it  is  not  of  rank  n  then  clearly 
it  is  impossible  to  solve  for  X(t)  from  (2.17)  and  hence  (2.14). 
Q.E.D. 

This  proof  also  shows  that  if  (D.GB jCAB , . . .CAP  ^B)  is  of  rank 
n  then  any  p-times  differentiable  output  Y  defined  on  a  finite  in¬ 
terval  can  be  produced  by  a  bounded  continuous  X,  Thus  the  re¬ 
lationship  between  X  and  Y  defined  by  such  a  system  is  linear  and 


one  to  one. 
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Notice  that  it  was  not  necessary  to  assume  that  the  system  was 
irreducible.  If  (D^CBjCAB, „ .  .CA^3  ^B)  is  of  rank  n  we  will  say  that 
the  system  is  nonsingular .  As  will  be  discussed  in  section  (2.6) 
this  terminology  is  in  harmony  with  the  usual  definition  of  non¬ 
singularity  as  applied  to  transfer  matrices. 

2.4  Nonlinear  Transformations 

We  now  examine  the  question  of  how  the  previous  results  are 
affected  by  the  presence  of  a  nonlinear  term.  Our  objective  is 
to  show  that  if  inputs  are  sufficiently  small  and  if  the  initial 
state  is  sufficiently  close  to  the  equilibrium  point  then  under 
some  circumstances  one  can  determine  the  properties  of  the  non¬ 
linear  system  by  an  examination  of  the  linear  terms. 

Let  N(a)  denote  the  set  of  elements  of  €^(1)  whose  no  ms  are 
less  than  a3  let  M(b)  denote  the  point  set  £z(t)|[z(t)f  <  bj  , 
and  let  M(a3b)  denote  the  point  set  jX(t)||X(t)|  <  aj  x  M(b) , 
Consider  the  system 

Z(t)  =  AZ(t)  +  BX(t)  +  Q.(X(t),Z(t))  (2.18a) 

Y(t)  =  CZ(t)  (2.18b) 

where  Q  and  its  partial  derivatives  with  respect  to  the  components 
of  X(t)  and  Z(t)  are  continuous  in  M(a,b)  and  vanish  at  X(t)  =  0, 
Z(t)  =0.  It  follows  from  the  classical  existence  and  uniqueness 
theorems"1'0  that  for  any  given  XeN(a)  and  any  given  Z(0)eM(b)  there 
exists  a  unique  solution  of  (2,15a)  defined  on  an  interval  o  and 
that  a  may  be  chosen  so  small  that  Z'(t)eM(b)  for  all  tea.  By 
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1 


restricting  Z(0)  still  further,  say  to  M(b/2)j  it  is  possible  to 
find  ai  interval  o'  such  that  any  XeN(a)  gives  rise  to  a  Z  which 
is  defined  on  a*  having  the  property  that  Z(t)eM(b)  for  tea1. 

Let  a  denote  an  interval  having  the  properties  of  a'  above. 
Then  (2.18a)  defines  a  mapping  of  N(a)  x  M(b/2)  into  C^(ct)  .  That 
this  mapping  is  continuous  in  Z(0)  for  fixed  X  has  long  been  known. 
To  see  that  it  is  continuous  in  X  for  fixed  Z(0)  let  Z  and  2*  be 
the  images  of  X  and  X’  respectively  and  suppose  that  2(0)  -  Z'(0)» 
It  follows  from  (2.18a)  that 


Z(t)-Z*(t)  - 


t  -As 
e 


(2.19) 

(BX(s)-BX'(s)  +  Q(X(s) ,Z(s))-Q(X'(s),Z»(s))ds 


From  the  assumptions  on  the  continuity  of  the  partial  derivatives 
of  Q(X(t),Z(t))  and  the  fact  that  the  matrix  expotential  is  bounded 
on  any  finite  interval  it  follows  that  for  tea  and  some  k  and  k' 


|Z(t)-Z'(t)|  <  k^tlZ(s)-Z»(s)|ds  +  k'  f  |X-X»  f  f  (2.20) 

Now  let  v(t)  *  J^\Z(s)-Z'  (s)  [d  s  so  that  (2.20)  becomes 
y(t)-kv(t)  <  k’||X-X!||.  By  treating  this  inequality  in  the 
standard  way  (Bellman  page  35)  it  is  easy  to  show  that 
|Z(t)-Z« (t) t  <  kn||X-X’||  for  some  kn  and  hence  to  establish  the 


continuity. 

A  second  preliminary  result  which  is  more  subtle  concerns  the 
conditions  under  which  (2,18a)  defines  a  continuous  mapping  of 
N(a’)  x  M(b')  into  C^,  The  result  which  we  establish  here  is 
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similar  to  Hahn's  statement  of  the  total  stability  theorem  ,  how¬ 
ever,  our  method  of  proof  is  quite  different. 

Lemma  2:  If  the  eigenvalues  of  A  have  negative  real  parts 
then  there  exists  a'  and  b',  both  positive,  such  that  (2.18a)  de¬ 
fines  a  mapping  of  N(a')  x  M(b')  into  which  is  continuous  in  X 

P 

for  fixed  Z(0). 

Proof  s  First  we  show  that  there  exists  a'  and  b*  such  that 

(2.18a)  maps  N(a')  x  M(b')  into  c\  Let  X  be  a  positive  number 

P 

such  that  is  greater  than  the  real  part  of  any  of  the  eigen- 
values  of  A.  Select  a  such  that  }e^[  <  ae~2Xt  for  all  t.  Note 
that  since  the  partial  derivatives  of  Q(X(t),Z(t))  with  respect  to 
X(t)  and  Z(t)  are  continuous  and  vanish  at  X(t)  *=  0,  Z(t)  =  0 
there  exists  c  and  c'  such  that  in  M(c,c'). 


Q(X(t),Z(t))  <  (fX(t)t  ♦  [Z(t)|)X/a 


(2.21) 


Using  this  and  the  above  bound  on  e  it  follows  from  (2.18a) 


that  for  some  k  and  all  tea 


[z(t)I 


<  ae"^tfZ(0)(  +  Xe_2Xt  rte2Xs[Z(s)tds  k|  |X|  | 

Jo 


(2.22) 


provided  XeN(c)  and  [Z(t)[  <  c'  for  tea.  Let  v(t)  =  /*^e2Xs|Z(s) |d 

.  lo 


Multiply  (2.22)  by  e  u  to  get 


^■(t)-Xv(t)  <  fZ(0)[a  +  k||X|[e^ 


(2.23) 


Now  multiply  this  equation  by  e*  and  integrate  by  parts  to  get 
v(t)e4Xt  <  pZ(0)[aA  ♦  kI[xUeaXtA]eK*  (2.24) 
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This  and  (2,23)  can  be  seen  to  imply 

|Z(t)|  <  2[Z(0)[ae"2Xt  +  ffX| |jk  (2.25) 

Again.,  this  equation  is  valid  for  all  t to  provided  [Z(t)|  <  c*  for 

tea.  But  (2.25)  implies  that  if  ||X[|  and  [Z(0)[  are  taken  to  'be 

sufficiently  small  then  Z(t)  can  be  made  less  than  c'  for  all  t 

and  hence  there  does  in  fact  exist  a'  and  b*  such  that  (2,18a)  maps 

N(a')  x  M(b')  into  C  . 

P 

To  show  continuity  choose  c  and  c5  such  that  for  any  X  and  X8 
belonging  to  N(c)  and  any  Z(t)  and  Z'(t)  belonging  to  M(c')  we 
have 

Q(X(t),Z(t))-Q(X'(t),Z'(t))  <  (|X(t)-X'(t)|  ♦  |Z(t)-Z'(t) f)X/d 

(2.26) 

Use  this  in  (2.19)  to  show  that  for  XeN(c)  and  some  k 

|Z(t)-Z» (t) |  <  \e"2Xt  rte2XttZ(s)tds  +  kf|X-X'[|  (2.27) 

Jo 

provided  | Z (t ) [  <  c*.  By  treating  this  in  the  same  way  as  (2.20) 
was  treated  one  can  show  that 

|fZ-Z'|l  <  !|X|[  .  Qik  (2.28) 

provided  ||x[|  and  J Z (0 ) |  are  sufficiently  small.  Q.E.D. 

In  general  the  inverse  of  the  system  (2.16)  will  not  have  a 
unique  solution  for  a  given  I.  For  this  reason  we  now  restrict 
ourselves  to  systems  of  the  form 
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(2.29a) 

(2.29b) 


Z(t)  =  AZ(t)  +  BX(t)  +  BQ.(X(t),Z(t)) 

Y(t)  =  CZ(t) 

where  Q(X(t),Z(t))  satisfies  the  same  conditions  as  before  in 
M(a,b).  By  differentiating  (2.29b)  in  the  same  way  as  we  did 
(2.5b)  previously  one  obtains 

CZ(t)  =  Y(t) 

CAZ(t)  =  Y^(t)  -  CB (X(t)  +  Q)  (2.30) 

CAP_1Z(t)  =  Y^P_1^(t)  -  GAP“2B(X(t)  +  Q) . . .CB (X(t)  +  Q) 

From  this  it  can  be  seen  that  if  it  is  possible  to  solve 
these  equations  for  Y^  in  terms  of  X  and  Z  the  solutions  will  not 
involve  derivatives  of  X.  In  general  this  is  not  true  for  the 
system  (2.18)  and  this  is  our  reason  for  restricting  our  attention 
to  the  system  (2.29). 

The  following  theorem  gives  one  set  of  conditions  which  are 
sufficient  to  insure  that  (2.29)  defines  a  continuous,  invert¬ 
ible  mapping.  It  should  be  observed  that  since  we  are  interested 
only  in  sufficient  conditions  no  ■  assumptions -on  drreducibility:'  1  ‘  t” 
need  be  made , 

Theorem  4;  Consider  the  system  (2.29)  with  the  given  assump¬ 
tions  on  Q(X(t),Z(t))  valid  in  M(a,b)..  Assume  further  that  CA^B  ■ 

0  for  i  =  0,  1,  ...  k-2  thens 

i)  There  exists  an  interval  a  such  that  for  fixed  Z(o)eM(b/2) 
this  system  defines  a  continuous  mapping  of  N(a)  into  C^(cr).  If 


-36- 


CAk  ^B  is  nonsingular  then  there  exists  c’  >  0  such  that  for  any 
given  Ye'Ck(a)  with  ||Y}[k  <  c1  there  is  a  unique  XeN(a)  satisfying 
the  system, 

ii)  If  all  the  eigenvalues  of  A  have  negative  real  parts  then 
there  exists  a!  and  b*,  both  positive,  such  that  this  system  de¬ 
fines  a  mapping  of  N(a')  x  M(b')  into  Ck  which  is  continuous  in 
X  for  fixed  Z(0).  If  in  addition  CAk  is  honsingular  and  if 
A-B(CAk_1B)“  kCAk  has  p-nk  eigenvalues  with  negative  real  parts  then 
there  exists  c1  and  d',  both  positive,  such  that  if  ||Y[|k  <  c' 
and  Z(0)eM(d')  then  corresponding  to  X  and  Z(0)  there  is  a  unique 
XeN(a)  which  satisfies  the  system. 

Proof ;  i)  Since  (2,29a)  is  a  special  case  of  (2.18'a)  it 
follows  that  the  remarks  about  existence  and  uniqueness  apply. 
Equation  (2.30)  shows  that  if  fCA1B  =  0  for  i  =  0,  1,  ...  k-2  then 
Y^(t)  depends  continuously  on  Z(t)  for  i  <  k-1  ,  and  that  Y^  (t) 
is  a  continuous  function  of  Z(t)  and  X(t) .  From  this  and  the  de¬ 
finition  of  the  Ck(<j)  norm  it  follows  that  (2.2^)  defines  a  con¬ 
tinuous  mapping  of  C  (cr)  into  Ck(o)  under  the  given  assumptions. 

Let  T  ^  =  CAk  ^B.  If  T  ^  is  nonsingular  then  it  follows  from 
(2.30)  and  the  assumptions  on  CA^B  that 

TCAkZ(t)  -TY^(t)  -  X(t)  -  Q(X(t),Z(t))  (2.31) 

17 

Applying  the  implicit  function  theorem  to  this  equation  gives 


X(t)  =  -TGAkZ(t)  +  TY^(t)  +  Q,'  (Z(t),Y^(t)) 


(2.32) 
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where  Q'  (Z(t)  (t) )  enjoys  the  same  properties  as  Q(X(t),Z(t)) 

in  some  set  M(a'jb').  Using  this  and  (2,29)  it  is  seen  that  the 
inverse  for  (2.29)  is 

Z(t)  =  (A-BTCAk)Z(t)  +  STCA^Y^  (t)  +  BQ '  (Y^  (t)  ,Z(t) ) 

(2.33a) 

X(t)  -  -TGAkZ(t)  +  TY^(t)  +  Q'  (Z(t),Y^(t)  (2.33b) 

From  the  remarks  made  about  (2.18a)  it  follows  that  (2.33a) 
defines  a  continuous  mapping  for  small  X,  2(0) }  and  a  and  hence 
the  mapping  of  Ck(cr)  into  (^(ct)  defined  by  2,33  is  continuous, 

ii)  In  view  of  lemma  2  it  remains  only  to  show  that  there 
exists  c*  and  d'  having  the  desired  properties.  Consider  (2. 33a). 
Define  P  from  (2.30)  exactly  as  P  was  defined  in  the  proof  of 
Theorem  13  part  iv.  Let  ¥(t)  =  PZ'(t) .  Partitioning  ¥  as  in  that 
proof  we  see  that  ¥^(t)  is  small  if  [|Y|[k  is  and  that  ¥^(t)  satis¬ 
fies  the  equation 

Wg(t)  =F22W2(t)  *F21W1(t)  +  G2Q'(P¥(t),Y(k)(t))  (2.34) 

If  A-B(CAk"1B)“1  CAk  has  p~nk  eigenvalues  with  negative  real  parts 
then  because  F  ^  =  0  all  the  eigenvalues  of  F ^  have  negative  real 
parts.  The  desired  result  now  follows  from  lemma  2.  Q.E.D, 

2.5  Asymptotic  Controllability 

Adequate  control  is  often  possible  even  if  a  system  fails  to 
have  a  continuous  inverse.  The  reason  being  that  for  many  processes 
the  exact  form  of  the.  response  is  unimportant  as  long  as  the  proper 
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steady-state  value  is  eventually  achieved.  For  this  reason  the 
problem  of  determining  the  circumstances  under  which  steady-state 
control  is  possible  is  of  interest.  We  begin  by  showing  how  this 
problem  can  be  stated  in  terms  of  a  mapping . 

In  order  to  state  this  problem  in  terms  of  a  mapping  it  is 
necessary  to  use  the  notion  of  an  equivalence  class.  Let  C 

—EL 

denote  the  subset  of  C  for  which  lim  Y(t)  exists.  The  norm  for 

n  t — >oo 

C  is  taken  to  be  the  usual  C  norm.  Partition  C  into  equivalence 
~n  n  -n  J 

classes  according  to  the  relationship  Y^X  if  lim  [Y(t)~X(t)|  ■=>  0 

t — ’>00 

and  let  underlined  capitals  such  as  Y  and  X  denote  particular 

equivalence  classes.  The  set  of  all  equivalence  classes  will  be 

denoted  by  R  and  the  norm  for  R  is  defined  as  |  [Y[  |  =  lim 

t — >oo 

| Y(t) I .  We  will  say  that  a  system  is  asymptotically  controllable 
if  it  defines  a  continuous  mapping  of  onto  ®  .  From  a  practical 
point  of  view  this  says  that  if  a  system  is  asymptotically  controll¬ 
able  then  its  outputs  may  be  made  to  approach  any  desired  steady- 
state  value  by  using  an  input  which  tends  toward  a  constant. 

Moreover s  it  implies  that  small  changes  in  the  input  result  in  small 
changes  in  the  asymptotic  value  of  the  output.  The  conditions  under 
which  the  system  (2.14)  is  asymptotically  controllable  are  given  by 
the  following  theorem. 

Theorem  5;  Consider  the  system  (2,14)  with  As  B,  C?  and  D 
constant.  Let  X(t)  and  Y(t)  be  n-vectors  and  assume  the  system 
is  irreducible.  Then  these  equations  define  an  asymptotically 
controllable  system  if  and  only  if  the  eigenvalues  of  A  have  negative 
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real  parts  and  t3A  T3+D  is  nonsingu'lar. 

Proof ;  The  necessity  of  the  condition  on  the  eigenvalues  of 

A  follows  from  lemma  1  and  the  fact  that  the  mapping  is  required 

to  be  continuous.  Assuming  that  the  eigenvalues  of  A  have  negative 

real  parts  it  follows  that  if  Xe'C  then 

n 


0  =  AZ  +  BX  (2„35a) 

Y  =  CZ  +  DX  (2.35b) 


As  explained  above ,  the  underlines  indicate  asymptotic  values. 

Since  A  has  no  zero  eigenvalues  it  is  nonsingular  and  hence  I  = 

(CA  +  D)X.  Therefore,  the  nonsingularity  of  the  matrix 

CA  +  D  is  necessary  and  sufficient.  Q.E.D. 

It  is  of  some  interest  to  note  that  the  nonsingularity  of 

(CA  ^B  +  D)  implies  that  the  matrix  (D,CB,CAB, . .  .CAP"'LB)  appearing 

in  theorem  3  is  of  rank  n  but  that  this  implication  may  not  be 

reversed.  The  problem  of  establishing  the  conditions  under  which 

(2.18)  is  asymptotically  controllable  in  a  neighborhood  of  the 

origin  immediately  suggests  itself.  In  view  of  lemma  2  such  a 

result  would  seem  reasonable,  however,  a  necessary  step  is  to  show 

that  under  suitable  restrictions  of  Z'(o)  and  |  |X[  |  (2.18)  maps 

C  into  'C  .  That  is,  one  must  show  that  Y(t)  — >  0.  If  additional 
— n  — n 

restrictions  are  placed  on  X,  e.g.,  if  it  is  required  that 
^"CD|X(t)|dt  be  small,  then  this  can  be  shown,  but  no  proof  in¬ 
dicating  that  Y(t)—>0  for  all  XeC  has  been  found. 
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2,6  Transfer  Matrix  Representation 

In  view  of  the  large  body  of  control  systems  theory  which  is 
based  on  the  use  of  transfer  functions  and  transfer  matrices  it 
is  appropriate  to  interpret  our  results  on  linear  systems  in  these 
terms.  Let  X(s)  and  Y(s)  denote  the  Laplace  transforms  of  X(t)  and 
Y(t)  respectively.  Then  if  X(t)  and  Y(t)  are  related  by  (2.14)  we 
have 

Y(s)  =  (C(ls-A)-1B  +  D)  X(s)  +  C(Is-A)_1Z(0)  (2.36) 

The  n  x  n  matrix  C(ls-A)  +  D  is  usually  called  the  transfer 
matrix.  It  is  an  important  property  of  the  system  and  many  of  our 
results  which  relate  to  linear  systems  can  be  interpreted  in  terms 
of  it . 

Let  P(s)  =  C(ls-A)-'I'B  +  D.  Our  objective  is  to  discuss  the 
various  assumptions  which  have  been  made  about  A,  B,  C,  and  D  in 
terms  of  P(s),  First  notice  that  for  all  values  of  s  such  that 
(Is~A)  is  nonsingular  (Is-A)  is  bounded  and  thus  the  poles  of 
P(s)  occur  where  det(ls-A)  »  0,  Since  the  values  of  s  which  satisfy 
this  equation  are  j\ist  the  eigenvalues  of  A  it  follows  that  if  A 
has  eigenvalues  with  negative  real  parts  then  the  poles  of  P(s)  lie 
in  the  left  half -plane.  Notice  that  detA  ~  0  then  P(s)  has  a  pole 
at  the  origin. 

The  number  of  times  that  an  output  is  differentiable s  i.e,  the 

largest  number  k  such  that  YeC^  for  all  XeC^j  can  also  be  easily 

•  * 

related  to  P(s).  Suppose  X(t)  is  such  that  X(0*)~  X(0)  -  J  ^  0, 
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based  on  the  use  of  transfer  functions  and  transfer  matrices  it 
is  appropriate  to  interpret  our  results  on  linear  systems  in  these 
terms.  Let  X(s)  and  Y(s)  denote  the  Laplace  transforms  of  X(t)  and 
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have 
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The  n  x  n  matrix  C(Is-A)  +  D  is  usually  called  the  transfer 
matrix.  It  is  an  important  property  of  the  system  and  many  of  our 
results  which  relate  to  linear  systems  can  be  interpreted  in  terms 
of  it. 

Let  P(s)  =  C(ls-A)_1B  +  D,  Our  objective  is  to  discuss  the 
various  assumptions  which  have  been  made  about  ks  B ,  C,  and  D  in 
terms  of  P(s).  First  notice  that  for  all  values  of  s  such  that 
(Is-A)  is  nonsingular  (Is-A)  is  bounded  and  thus  the  poles  of 
P(s)  occur  where  det(Is-A)  ~  0,  Since  the  values  of  s  which  satisfy 
this  equation  are  just  the  eigenvalues  of  A  it  follows  that  if  A 
has  eigenvalues  with  negative  real  parts  then  the  poles  of  P(s)  lie 
in  the  left  half -plane.  Notice  that  detA  ~  0  then  P(s)  has  a  pole 
at  the  origin. 

The  number  of  times  that  an  output  is  differentiable ,  i.e.  the 
largest  number  k  such  that  YeC^  for  all  XeC^  can  also  be  easily 
related  to  P(s).  Suppose  X(t)  is  such  that  X( 0^)-  X(0)  J  f  0, 
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Then  by  applying  the  initial  value  theorem  it  follows  that 

Y(0+)  -  Y(0)  =  lim  P(s)J„  Therefore  if  P(co)  -  0  then  the 
s— joo  k 

mapping  is  into  C^.  In  general  the  mapping  will  be  into  if 

and  only  if 

lim  s^Cs)  ~  0  for  i  *  05  15  ...  k-1  (2.37) 

5—>oo 

Since  det  P(s)  is  a  rational  function  it  either  vanishes 
identically  or  else  is  nonzero  at  all  but  a  finite  number  of 
points  in  the  s-plane.  In  the  later  case  it  is  possible  to  solve 
the  equation  Y(s)  =  P(s)X(s)  for  X(s)  and  P(s)  is  said  to  be  non¬ 
singular.  The  condition  that  (D  pCB^CAB ,,  „ .  ,CA^  ^B)  be  of  rank  n 
implies  that  P(s)  is  nonsingular  and  therefore  justifies  our  termin¬ 
ology. 

If  the  system  is  nonsingular  then  X(s)  =  P  ^(s)Y(s).  Since 
P  (s)  contains  poles  where  det  P(s)  vanishes  it  follows  that  X(s) 
will  contain  right  half -plane  poles  if  det  P(s)  contains  right 
half -plane  zeros.  Moreover ,  the  zeros  of  det  P(s)  coincide  with 
the  eigenvalues  of  the  inverse  equation.  Much  of  the  utility 
of  the  inverse  equation  stems  from  this  fact.  Systems  for  which 
det  P(s)  has  no  right  half -plane  zeros  are  sometimes  called 
minimum  phase  systems.  Thus  if  the  inverse  equation  is  stable  the 
system  is  minimum  phase. 

The  final  value  theorem  states  that  if  P(s)  has  no  poles  in 
the  right  half -plane  or  on  the  jco-axis  then  Y  -  P(0)X_,  but  P(0) 
is  simply  CA  +■  D  so  that  the  condition  for  asymptotic  con- 
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t reliability  can  be  expressed  in  terms  of  P(s)  by  saying  P(0) 
should  be  non singular. 

A  detailed  explanation  of  the  implications  of  irredu cibility 

18  £4 

has  been  given  by  Gilbert  and  Kalman  .  Roughly  speaking 

irredu cibility  implies  that  there  are  no  redundant  variables  in 

the  state  vector.  If  X  and  Y  are  scalars  then  irredu cibility 

insures  that  the  numerator  and  denominator  of  P(s)  contain  no 

common  factors  and  that  the  degree  of  the  denominator  is  the  same 

as  the  dimension  of  Z.  Also  in  the  scalar  case  it  may  be  shown  that 

the  degree  of  the  numerator  of  P(s)  is  p-l-k  where  p  is  the  dimension 

of  Z  and  lc  is  the  least  nonnegative  integer  i  for  which  CA^B  is 

nonzero.  By  applying  the  results  of  theorem  1  we  see  that  if  the 

degree  of  the  denominator  of  the  transfer  function  exceeds  that  of 

the  numerator  by  k  and  if  the  denominator  contains  left  half -plane 

k 

zeros  only  then  the  system  maps  into  The  condition  for 

asymptotic  controllability  is  that  the  numerator  polynomial  should 
not  have  a  zero  at  the  origin. 

2.7  Conclusions 

Our  objective  has  been  to  develop  a  means  for  characterizing 
nonhomogeneous  differential  equations  which  places  in  evidence 
certain  properties  which  are  important  in  automatic  control 
applications.  It  has  been  shown  that  the  systems  uider  consideration 
can  be  viewed  as  a  mapping  whose  domain  is  a  set  of  functions 
and  whose  range  can  be  either  a  set  of  functions  or  a  set  of 
equivalence  classes.  Moreover,  it  is  possible  to  introduce  norms 
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for  the  input  and  output  spaces  in  such  a  way  as  to  make  it  possible 

i 

to  express  many  of  the  standard  problems  in  control  in  terms  of  the 
continuity  and  invertability  of  these  mappings.  The  idea  of  com¬ 
puting  an  inverse  equation  is  introduced  and  the  role  of  this 
equation  in  determining  the  character  of  the  mapping  was  discussed. 

These  ideas  have  been  applied  to  linear,  time -invariant  systems 
and  to  a  certain  class  of  nonlinear  systems.  The  principle  results 
are  the  following: 

1)  If  the  system  is  linear,  time -invariant,  and  irreducible, 
then  continuity  and  asymptotic  stability  are  equivalent. 

2)  If  the  system  is  described  by  (2.5)  and  CA^B  =  0  for 

i  =  0,  1,  ...  k-2  with  CAk  being  of  full  rank  then  explicit  for- 
hlas  can  be  given  for  the  inverse  equation  and  Theorem  1  provides 
a  complete  description  of  the  mapping.  The  most  general  single - 
input  single-output  system  is  of  this  type. 

3)  The  mapping  defined  by  (2.14)  has  an  inverse  if  and  only 
if  the  matrix  (D,CB,CAB,  „ .  .CA^^B)  is  of  rank  n.  The  term  non¬ 
singular  was  used  to  describe  such  systems. 

4)  If  certain  continuity  requirements  are  satisfied  the  effect 
of  a  nonlinear  term  of  the  form  BQ(X(t)  ,Z'(t) )  can  be  ignored  if 

one  is  only  interested  in  local  behavior. 

5)  The  concept  of  asymptotic  controllability  was  introduced 
and  it  was  shown  that  the  system  (2.14)  is  asymptotically  con¬ 
trollable  if  and  only  if  the  system  is  asymptotically  stable 

and  CA  +  D  is  of  full  rank. 
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Important  problems  which  remain  include  that  of  obtaining 
a  representation  of  the  inverse  equation  which  would  apply  to  the 
general  case  and  proving  that  the  conditions  for  asymptotic  con¬ 
trollability  are  not  affected  by  a  nonlinear  term  of  the  form  dis¬ 


cussed  in  theorem  4„ 


CHAPTER  III 


THE  EFFECT  OF  FEEDBACK 

3.1  Types  of  Feedback 

If  the  relationships  between  a  systems  inputs  and  outputs  are 
unsatisfactory  one  often  seeks  to  improve  them  by  the  use  of  feed¬ 
back.  For  example.,  if  the  system  equations  are 

Z(t)  «F(Z(t),X(t))  (3.1a) 

Y(t)  =  G(Z(t),X(t))  (3.1b) 

and  if  the  relationship  between  X  and  I  that  these  equations  impose 
is  unsuitable  then  it  may  be  possible  to  remedy  the  situation  by 
forcing  X(t)  to  depend  on  I.  Ordinarily  this  has  the  effect  of  re¬ 
placing  X(t)  in  (3.1a)  by  X(t)  +  X'(t)  where  X'(t)  is  dependent  on 
I.  Much  of  the  work  in  automatic  control  theory  is  concerned  with 
the  evaluation  of  the  effects  of  such  substitution. 

This  chapter  is  concerned  with  the  question  of  how  feedback 
affects  the  properties  which  were  discussed  in  the  previous  chapter 
and  is  also  concerned  with  its  effect  on  stability.  The  objective 
is  to  define  some  of  its  possibilities  and  limitations.  The 
question  of  how  certain  types  of  feedback  affect  the  properties  of 
irredu cibilityj  stability,,  and  stability  of  the  inverse  equation 
will  be  examined.  Our  principle  results  relate  to  linear }  time- 
invariant  systems  with  linear  and  nonlinear  feedback. 

The  dependence  of  X'(t)  on  Y  can  take  many  forms.  If  X^t) 
is  a  function  of  Y(t)  only.,  i.e.  X»(t)  =  G(Y(t))  then  one  speaks 
of  gain  feedback.  A  more  general  case  is  where  X'(t)  = 
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G(Y(t),Y^(t),...Y^(t)).  This  is  usually  called  derivative 
feedback  and  includes  gain  feedback  as  a  special  case.  A  third 
possibility  is  that  the  relationship  between  X' (t)  and  Y  is  itself 
a  dynamic  one  being  defined  by  a  system  of  the  form 

Z'(t)  -  F'(Z'(t),Y(t))  (3.2a) 

X'(t)  =  G'(Z«(t),Y(t))  (3.2b) 

This  is  a  generalization  of  what  is  usually  called  integral  feed¬ 
back. 

Notice  that  derivative  feedback  does  not  change  the  number  of 
initial  conditions  needed  to  describe  the  motion^  that  is,  it  does 
not  change  the  dimension  of  the  differential  equation  of  the  system. 
This  is  not  true  for  integral  feedback,  however,  as  can  be  seen  by 
eliminating  X'(t)  from  (3.1)  and  (3.2),  If  this  is  done  the  re¬ 
sulting  system  is 

Z(t)  =F(Z(t),X(t)  +  G'(Z'(t),G(Z(t),X(t)))  (3.3a) 

Z'(t)  -F'(Z»(t),G(Z(t),X(t}))  (3.3b) 

Y(t)  =  G(Z(t) ,X(t))  (3.3c) 

The  system  (3.3)  clearly  has  a  higher  dimensional  state  vector  than 
the  original  system.  The  equations  (3.3a)  and  (3.3b)  which  deter¬ 
mine  the  stability  of  the  closed  loop  system  constitute  what 
Lefschetz  terms  "differential  equations  on  product  spaces".  It 
is  interesting  to  note  the  immediate  application  which  the  theory 
of  such  equations  has  in  automatic  control. 
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In  studying  the  effects  and  limitations  of  feedback  as  a  device 
for  altering  a  given  system  we  seek  to  identify  those  properties  of 
the  system  which  are  not  affected  by  a  class  of  feedbacks.  In 
general  stability  is  not  such  a  property  for  it  is  easy  to  give 
examples  of  both  unstable  systems  which  can  be  stabilized  by  feed¬ 
back  and  stable  systems  which  have  been  unstabilized  by  feedback. 
Other  properties  such  as  invertibility,  irreducibility?  and  stability 
of  the  inverse  equations  do,  however,  remain  unchanged  by  the 
addition  of  wide  class  of  feedbacks.  Under  some  circumstances  even 
stability  remains  unaffected  by  the  addition  of  feedback  as  the 
recent  work  on  the  absolute  stability  problem  shows. 

The  main  results  given  here  relate  to  linear  systems  having 
linear  and  nonlinear  derivative  feedback.  The  corresponding  pro¬ 
blems  involving  integral  feedback  seem  to  be  more  difficult  and  are 
not  treated  here.  It  is  important  to  note  that  in  our  definition  of 
feedback  we  have  not  allowed  the  feedback  to  be  a  function  of  the 
entire  state  vector  Z,  but  instead  it  is  assumed  that  the  feedback 
relates  to  the  actual  output  of  the  system  Y.  This  assumption  seems 
to  be  the  natural  one  from  a  physical  standpoint  although  other 
authors  have  taken  a  different  point  of  view. 

3,2  Feedback  Invariants 

As  mentioned  above  the  results  to  be  discussed  relate  to  linear, 
time -invariant  systems  with  linear  and  nonlinear  feedback.  Consider 
the  system 


Z(t)  =  AZ(t)  +  BX(t) 


(3.4a) 
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Y(t)  -  CZ(t)  (3.4b) 

to  ■which  will  be  added  either  linear  feedback  of  the  type 

X'(t)  =  GQY(t)  +  GjI^t)  +  ...  GkI(k)(t)  (3.5) 

or  else  nonlinear  feedback  of  the  type 

X'(t)  -  G(Y(t),Y(l)(t),  ...  Y(k)(t))  (3.6) 


In  either  case  we  will  say  that  the  feedback  is  of  order  k  if  the 
'k—  derivative  of  Y(t)  is  the  highest  one  appearing  in  the  ex¬ 
pression  for  X'(t). 

In  view  of  the  role  played  by  irredu cibility  in  the  previous 
chapter  the  following  theorem  showing  that  linear  derivative  feed¬ 
back  does  not  affect  the  irredu cibility  is  of  interest. 

Theorem  6;  Assume  that  the  system  (3.4)  is  irreducible  and 
k 

maps  C  into  C  „  Then  if  linear  derivative  feedback  of  order  k-1 
^  n  n 

or  less  is  applied  the  resulting  system  is  still  irreducible. 

k 

Proof :  Since  the  original  system  is  assumed  to  map  into 
it  follows  from  theorem  1  that  CA^B  is  zero  for  i  =0,  1,  ...  k-2, 
and  hence  that  Y  ^(t)  =  CA^Z(t)  for  the  same  values  of  i.  The 
differential  equations  after  feedback  can  be  expressed  in  terms  of 
Y  as 


Z(t)  =  AZ(t)  ♦  BX(t)  +  B(GQY(t)  +G1Y(1^(t)  +  ...  G^Y^"1^  (t) ) 

(3.7) 

Expressing  Y^  in  terms  of  Z(t)  enables  one  to  write  the  equations 


of  the  feedback  system  as 
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Z(t)  -  FZ(t)  +  BX(t)  (3.8a) 

X(t)  -  CZ(t)  (3.8b) 

where 

F  A  +  BG  C  ♦  BG  CA  +  . . .  BG,  ^CA^1  (3.9) 

In  showing  the  irreducibility  of  (3.8)  we  will  use  the  fact 
that  'CF1  =  CA1  for  i  =  0,  1,  ...  k-1.  This  follows  immediately 
from  the  form  of  F  and  the  fact  that  CA1!?  =  0  for  i  =  0S  1,  ... 
k-2.  To  show  that  (3.8)  is  irreducible  it  is  necessary  and 
sufficient  to  show  that  (BJFBS...FP"1B)  and.  (T5T,FT’cF,  ...FTp“1cr) 
are  both  of  rank  p  where  p  is  the  dimension  of  Z(t).  Suppose 
(B,FBs..„Fp“1B)  is  not  of  rank  p.  Then  there  exists  a  nonzero f 
p-dimensional,  row-vector  H  such  that  H(B ,FB , . . ,Fp-1B)  =  0.  This 
implies  HF"^  =  0  for  i  =  0,  1,  ...  p-1.  Using  the  fact  that  HB  =  0 
it  follows  that  HAB  =  HFB  and  hence  vanishes.  Reasoning  inductively, 
it  follows  that  if  HF"^  =  0  for  i  =  0,  1,  ...  j-1,  then  HA^B  * 

HF^B.  What  this  shows  is  that  if  H(B,FB,„..FP  ^B)  vanishes  then 
H(BSAB,...AP~1B)  vanishes.  This  last  statement  establishes  a  con¬ 
tradiction  because  H  was  assumed  to  be  nonzero  and  the  assumed 
irreducibility  of  (3.4)  implies  that  (B,AB,...AP  ^B)  is  of  rank  p. 

To  show  that  (CT,FTCT,  „ .  .F^^)  is  of  rank  p  again  assume  the 
contrary  and  let  H  be  a  nonzero  p-dimensional  row- vector  such  that 
H(C^  SF^C^S .  „  .F^P  "4i)  =  0.  As  noted  above  CF^"  =  CA^  for  i  0,  1,... 
k-1  so  the  assumption  of  H  implies  that  H(C  ,A  C  ,  ...A  "13 )  =0. 

But  from  the  form  of  F  it  follows  that  if  CA^  V  -  0  then  FH^  =  AH^ 
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and  hence  GF^%T  =  CAk'  ■■VkT  *  CA^l1 .  Therefore  the  vanishing  of 
V  rn  T  T  T  Tic  Tv 

CF  H  implies  H(C  ,A  C  ,„,A  C  )  *>  0.  This  argument  can  now  be 
repeated  p-l-k  times  to  show  that  H(C^jA^C^} , . .A^  ^C^)  *=  0.  The 
last  statement  contradicts  the  assumption  that  (3.4)  was  irreducible 
and  completes  the  proof. 

The  assumption  on  the  order  of  the  feedback  was  used  in  an 
essential  way  and  it  is  possible  to  construct  examples  which  show 
that  theorem  6  is  not  true  if  the  derivative  feedback  is  allowed  to 
be  of  order  k.  A  second  result  relating  to  this  type  of  feedback 
is  the  following  theorem  which  shows  that  with  the  above  assumptions 
feedback  cannot  make  a  nonsingular  system  singular. 

Theorem  7;  Assume  that  the  system  (3.4)  is  irreducible  and 

k 

nonsingular  and  that  it  maps  into  Cq.  Then  if  linear  derivative 
feedback  of  order  k-1  or  less  is  applied  the  resulting  system  is  also 
nonsingular  and,  if  the  eigenvalues  of  F  have  negative  real  parts 
then  it  defines  a  continuous  mapping  of  C^  into  C^. 

Proof;  To  show  that  the  non singularity  of  (3.4)  implies  the 
non singularity  of  (3.8)  it  is  necessary  to  show  that  if  (CB,CAB,.., 
CA^3  ^B)  if  the  rank  n  then  so  is  (CB,CFB, . .  .-CF^’^B) .  Assume  that 
the  matrix  (CB, CFB, . „ .CF^3  ^B)  is  not  of  rank  n.  Then  there  exists 
a  nonzero  n-vector  H  such  that  H(CB,CFB,...CF^  ^B)  =  0.  Since  our 
assumptions  insure  that  CF1  =  CA1  for  i  =0,  1,  ...  k-1  it  follows 
that  H(CB<,CAB5„.  „CAk'’'LB)  =  0.  Consider  HCF^B  which  may  be  written 
as  HCAk-1FB  -  HCA^B.  This  shows  H(CB,CAB, .  ..CA1^)  =  0.  This 
argument  can  now  be  repeated  p-l-k  times  to  show  that  H(CB,CAB  . 


-51- 


CA.P'  "  d)  -  0.  This  is  a  contradiction,  however,  because  H  was 
assumed  to  be  nonzero  and  (CB,CAB, . . „CAP  ^B)  is  of  rank  n. 

Showing  that  if  the  eigenvalues  of  F  have  negative  real  parts 
then  the  system  with  feedback  defines  a  continuous  mapping  of  € 
into  is  equivalent  by  theorem  1  to  showing  that  CF^B  =  0  for 
i  =  0,  1,  ...  k-2.  But  as  noted  above  CF1  =  CA1  for  these  values  of 
if  and  therefore  since  GA^B  =  0  for  i  =0,  1,  , „ „  k-2  the  desired 
result  is  obtained.  Q.E.D. 

In  addition  to  irreducibility  and  nonsingularity,  asymptotic 
controllability  is  also  unaltered  by  the  type  of  linear  derivative 
feedback  under  discussion,  provided  the  resulting  system  is  stable. 
The  exact  result  is  this. 

Theorem  8:  Assume  that  the  system  (3.4)  is  irreducible  and 
that  it  defines  a  continuous  mapping  of  into  Cn„  Suppose  that 
linear  derivative  feedback  of  order  k-1  or  less  is  applied  and  that 
the  resulting  system  also  defines  a  continuous  mapping  of  C  into 
C  .  Then  the  system  with  feedback  is  asymptotically  controllable 
if  and  only  if  the  original  system  (3.4)  was. 

Proof ;  Assume  that  the  equations  after  feedback  are  given  by 
(3.7),  (3.8),  and  (3.9).  Because  the  original  system  was  assumed 
to  be  irreducible  it  follows  from  theorem  6  that  the  system  with 
feedback  is  also  irreducible.  Since  both  systems  are  assumed  to 
define  continuous  mappings  it  follows  from  lemma  1  that  the  differ¬ 
ential  equations  (3.4a),  (3.7)  and  (3.8a)  are  all  asymptotically 
stable.  Let  X.  Y  and  Z  denote  steady-state  values.  Then  (3.7)  im- 
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plies  that  AZ  ♦  BX  +  BG^Y  =  0.  The  asymptotic  stability  of  (3.4a) 
implies  that  A  is  nonsingular.  Using  this  and  (3.8b)  we  see  that 
for  the  system  (3.7)  or  (3.8)  we  have 

(I  CA_1BGo)  Y  +  CA-1B  X  =  0  (3.10) 


From  theorem  5  it  follows  that  the  system  (3.4)  is  asymptotic¬ 
ally  controllable  if  and  only  if  CA  is  nonsingular.  Equation 
(3.10)  shows  that  under  the  assumptions  made  here,  this  condition 
is  necessary  for  (3.8)  to  be  asymptotically  controllable  also.  To 
show  that  it  is  sufficient  it  is  necessary  to  show  that  I  ♦  CA  ^BGc 
is  nonsingular. 

As  noted  above ,  A  is  nonsingular.  Moreover,  since  (3.8a)  is 
asymptotically  stable  F  is  nonsingular  also.  From  the  definition 
of  F  and  the  fact  that  CA2®  =  0  for  i  *  0,  1,  ...  k-2  it  follows 


Ch  =  B(I  ♦  G  CA_1B) 


(3.11) 


Notice  that  since  CA  'iB  is  of  rank  n  it  follows  that  B  is  of  rank  n 
Since  F  and  A”^  are  nonsingular  and  of  dimension  n  or  larger  it 
follows  that  FA~^B  is  of  rank  n.  But  from  (3.1l)  it  follows  that 
the  n  x  n  matrix  I  +  G  CA"’^B  is  nonsingular.  If  this  matrix  is  pre 
multiplied  by  CA  and  postmultiplied  by  (CA  ^B)  ^  the  matrix 
I  +  CA  ^BG  is  obtained.  Since  similarity  transformations  do  not 
alter  rank  it  follows  that  I  +  CA  ^B  is  of  full  rank  and  that  the 
system  (3.8)  is  asymptotically  controllable.  Q.E.D. 


It  is  clear  that  the  type  of  feedback  being  discussed  here  can 
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not  make  a  reducible  system  irreducible  or  a  singular  system  non¬ 
singular,  In  view  of  this,,  theorems  6>  73  and  8  show  that  irreduci- 
bility,  nonsingularity3  and  asymptotic  controllability  are  not 
affected  by  a  rather  wide  class  of  linear  feedbacks.  The  purpose  of 
the  next  theorem  is  to  show  that  the  homogeneous  solutions  of  the 
inverse  equation  enjoy  a  similar  property  even  if  nonlinear  deriva¬ 
tive  feedback  is  applied. 

Theorem  9;  Assume  that  the  system  (3.4)  is  nonsingular  and 
that  it  maps  into  C  ,  Consider  the  system 

Z(t)  «  AZ(t)  ♦  BX(t)  +  BG(CZ(t),CAZ(t),  ...  CAk“1Z(t)  (3.12a) 
Y(t)  =  CZ(t)  (3.12b) 

which  is  obtained  from  (3.4)  by  applying  nonlinear  derivative  feed¬ 
back  or  order  k-1.  Then  any  solutions  Z  of  (3.4a)  for  which  CZ(t) 
vanishes  identically  is  also  a  solution  of  (3.12a). 

Proofs  If  CZ(t)  vanishes  identically  then  Y(t)  does  also 
and  hence  CA1Z(t)  is  identically  zero  for  i  =  0,  1,  ...  k-1.  Using 
this  (3„12a)  assumes  the  form  of  (3.4a)  and  therefore  has  the  same 
solutions.  Q.E.D. 

What  this  theorem  shows  is  that  feedback  does  not  alter  the 
homogeneous  solutions  of  the  inverse  equation.  Therefore  if  the 
differential  equation  of  the  inverse  system  is  unstable  (i.e.  if 
the  inverse  mapping  is  not  continuous)  then  derivative  feedback 
of  the  type  being  discussed  will  not  remedy  the  situation.  This 
can  be  regarded  as  a  generation  of  the  statement  that  a  nonminimum 
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phase  system  cannot  be  made  minimum  phase  by  the  application  of 
derivative  feedback. 

The  preceding  group  of  four  theorems  define  some  of  the  pro¬ 
perties  of  a  system  which  cannot  be  affected  by  feedback. 

The  results  which  were  obtained  relate  to  feedback  of  arbitrary 
magnitude  and  sign.  In  the  following  section  we  will  show  that  by 
restricting  the  sign  some  results  of  this  type  relating  the  stabil¬ 
ity  of  the  closed-loop  to  the  stability  of  the  open-loop  system  can 
be  obtained. 

3.3  Feedback  and  Stability 

The  problem  of  predicting  the  stability  properties  of  the 
closed-loop  system  in  terms  of  the  open-loop  behavior  is  a  basic 
one.  The  standard  tools  for  designing  linear  control  systems,  such 
as  the  root-locus  and  the  Nyquist  plot,  are  useful  primarily  be¬ 
cause  they  answer  this  question.  Our  objective  here  is  to 'discuss 
the  effects  of  linear  and  nonlinear  feedback  on  stability  and  to 
show  how  the  inverse  equation  helps  to  predict  the  effects  of  feed¬ 
back.  We  will  discuss  only  the  case  where  X  and  Y  are  scalars. 

Consider  the  single -input ,  single -output  system 

Z(t)  =  AZ(t)  +  Bx(t)  (3.13a) 

y(t)  =  CZ(t)  (3.13b) 

and  assume  Z(t)  is  p~dimensional.  Let  K  denote  the  set  of  scalar 
valued  functions  of  a  scalar  argument  which  are  continuous  and  have 
the  additional  property  that  xf (x)  >  0  for  all  nonzero  x  and  all 
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feK,  The  system  (3.13)  is  said  to  be  absolutely  stable  if  the 
differential  equation  (3,13a)  is  asymptotically  stable  in  the  large 
whenever  X(t)  is  replaced  by  -  f(y(t))  =  -f(CZ(t))and  feK.  That  is, 
the  system  (3.13)  is  absolutely  stable  if  the  solution  Z(t)  s  0  of 
the  e  quation 

Z(t)  =  AZ(t)  -  Bf (CZ(t))  (3.14) 

is  asymptotically  stable  for  all  Z^(0)  and  all  feK. 

45 

This  problem  has  been  widely  studied  and  recently  Popov  and 
25 

Kalman  have  made  important  contributions.  Our  principle  objective 
here  is  to  show  that  a  necessary  condition  for  absolute  stability 
is  that  either  CB  or  GAB  be  nonzero  and  that  the  inverse  equation 
be  stable.  This  condition  is  not  sufficient  however  except  in  the 
special  case  where  Z(t)  is  two  dimensional  and  (3.13a)  is  itself 
asymptotically  stable. 

Theorem  10s  If  (3.14)  is  absolutely  stable  then  either  CB  or 
GAB  is  nonzero  and  the  inverse  equation  is  stable. 

Proof;  Since  K  contains  all  the  functions  of  the  form  kx  where 
k  >  0  it  follows  that  if  we  can  show  that  if  CB  and  CAB  are  zero  or 
if  the  inverse  equation  is  unstable  then  there  exists  a  k  such  that 
the  equation  . 

Z(t)  =  AZ(t)  -  kB'CZ(t)  (3.15) 

in  unstable  we  will  have  established  the  proof.  From  the  remarks 
made  in  section  2.6  it  follows  that  (3.13)  implies  an  equation  of 
the  form 

(D-X^  (D-\g)  ...  (D-X  )y(t)  K  a(D-o'1))(D-cr2)  ...  (D-aq)x(t)  (3.16) 
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where  D  =  d/dt,  X^  are  the.  eigenvalues,  and  cn  are  the  eigenvalues 
of  the  inverse  equation.  Substituting  x  =  ky  =  k'y/a  we  have 

|D-X1)(D-X2)  ...  (D-Xp)  +  k«(D-oL)(D-CTg)  ...  (D-crJ  y  -  0  (3.18) 

Now  if  €B  and  CAB  are  zero  q  <  p-2.  As  is  known  from  the  theory  of 
the  root-locus  this  implies  that  for  large  values  of  k  the  equation 
(3.18)  becomes  unstable.  A  similar  conclusion  follows  if  any  of  the 
a  have  positive  real  parts.  Q.E.D. 

3.4  Conclusions 

A  very  important  problem  in  automatic  control  is  that  of 
evaluating  the  effects  of  feedback.  The  purpose  of  this  chapter  was 
to  explore  the  effects  of  feedback  in  terms  of  its  effects  on  the 
mapping  and  on  stability.  1  was  shown  that  a  certain  class  of  de¬ 
rivative  feedbacks  do  not  affect  irreducibility,  invertibility  and 
asymptotic  controllability,  provided  the  system  with  feedback  is 
stable. 

The  problem  of  evaluating  the  effect  of  feedback  on  stability 
was  discussed  and  the  problem  of  absolute  stability  was  mentioned. 

It  was  shown  that  for  single-input,  single-output  systems  stability 
of  the  inverse  equation  is  a  ncessary  condition  for  absolute 
stability  and  that  another  necessary  condition  is  that  either  CB, 


or  CAB  be  nonzero 


CHAPTER  IV 


TIME -OPTIMAL  CONTROLS 

4.1  Introduction 

In  spite  of  the  large  amount  of  research  which  has  been  de¬ 
voted  to  the  time -optimal  control  problem  a  number  of  basic 
questions  remain  unsolved.  In  particular  the  nature  of  the  time- 
optimal  forcing  function  for  a  single -input ,  single -out put  system 

which  has  a  transfer  function  with  a  nonconstant  numerator  is  still 

34  4  20 

not  well  understood  as  Lee  s  Athanassiades  and  Falb  }  and  Harvey 

have  all  pointed  out.  Because  systems  of  this  type  are  frequently 
encountered  in  practice  it  is  important  to  clarify  the  essential 
differences  between  this  problem  and  the  usual  problem  where  the 
objective  is  stated  in  terms  of  the  state  rather  than  the  output. 
Similar  problems  are  associated  with  multivariable  systems.  In 
this  case  the  difficulties  are  more  severe  and  even  the  term 
''numerator  dynamics''^  which  is  sometimes  used  to  describe  the 
troublesome  single  variable  cases ,  requires  a  careful  interpre¬ 
tation. 

The  problems  to  be  considered  here  are  similar  to  those  dis- 
20  21 

cussed  by  Harvey  and  Harvey  and  Lee  but  the  methods  to  be  used 
are  quite  different.  The  novelity  of  our  approach  centers  around 
the  use  of  the  inverse  equation  to  define  the  so  called  "target 
set"  and  the  use  of  frequency  domain  methods  to  enable  a  parametric 
representation  of  the  optimal  input.  The  result  is  that  the  time- 
optimal  problem  is  reduced  to  a  nonlinear  programming  problem  in 
which  no  differential  equations  appear.  One  objective  of  this  work 
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is  to  derive  the  general  form  of  this  programming  problem  and  to 

consider  some  example  problems. 

20 

As  Harvey  has  pointed  out,  when  the  objective  of  the  system 
is  expressed  in  terms  of  the  output  instead  of  the  state  it  is  im¬ 
portant  to  distinguish  between  several  different  statements  of  the 
optimization  problems  which  in  a  simpler  context  might  be  equivalent. 
A  further  objective  of  this  work  is  to  enlarge  upon  this  point  and 
to  suggest  appropriate  statements  for  the  optimization  problem  for 
three  important  cases. 

4.2  Problem  Formulation 

Consider  a  linear,  time -invariant,  system  defined  by  the 
equations 

Z(t)  =  AZ(t)  +  BX(t)  (4.1a) 

Y(t)  =  CZ(t)  (4.1b) 

and  assume  X(t)  and  X(t)  are  n-vectors  with  Z(t)  being  a  p-vector. 

One  of  the  simplest  statements  of  a  time-optimal  problem  is  this: 
Find  the  input  X  such  that  ]x^j  <  an  and  Z(t)  is  taken  from  the 
initial  state  Z(0)  to  a  desired  state  Z(t#)  in  minimum  time.  We 
shall  call  this  the  standard  problem.  Some  sufficient  conditions 
for  this  problem  to  have  a  unique  solution  can  be  found  in  reference 
44. 

For  many  purposes  it  is  more  realistic  to  state  the  objective 
in  terms  of  the  output  Y  rather  than  in  terms  of  the  state  as  is 
done  above.  The  reason  being,  of  course,  that  one  doesn't  usually 
care  what  the  value  of  the  state  is  provided  Y(t)  is  well  behaved. 
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If  this  point  of  view  is  adopted  it  is  necessary  to  distinguish  be¬ 
tween  several  alternative  statements  of  the  optimization  problem. 

To  simplify  the  discussion  only  the  case  where  the  desired  value  of 
Y(t)  is  zero  will  be  considered.  In  this  instance  the  following 
three  formulations  of  the  time-optimal  problem  have  areas  of 
applicability  and  in  general  have  different  solutions. 

(A)  Infinite  Time  Problem.  Given  the  system  (4.1),  the  class 
of  allowable  inputs  S  ,  and  the  initial  state  Z(0),  find  XeS  such 

X  X 

that  Y(t)  is  identically  zero  for  t*  <  t  and  t*  is  a  minimum. 

(B)  Finite  Time  Problem.  Given  the  system  (4.1) ,  the  class  of 
allowable  inputs  S^,  and  the  initial  state  X(t)  is  identically  zero 
for  t*  <  t  <  t'  and  t*  is  a  minimum. 

(C)  Conditional  Time  Problem.  Given  the  system  (4.JL),  the 
class  of  allowable  inputs  S^,  and  the  initial  state  Z(0),  find  XeS^ 
such  that  Y(t*) ,  Y^  (t*)  are  all  zero  and  t*  is  a  minimum. 

The  infinite  time  formulation  might  be  suitable  for  a  regular 
problem.  The  finite  time  formulation  is  appropriate  when  the 
duration  of  the  process  is  fixed  in  advance.  For  problems  where 
the  termination  time  depends  on  when  the  objective  is  reached,  such 
as  is  the  case  in  rendezvous  problems,  the  conditional  time  state¬ 
ment  should  be  used.  The  conditions  imposed  by  A,  B,  and  C 
represent  successively  weaker  requirements  on  Y  and  hence  it  is  to 
be  expected  that  for  a  fixed  Z(0)  the  response  time  associated  with 
A  will  be  greater  than  or  equal  to  the  response  time  associated  with 
B  and  that  it,  in  turn,  will  be  greater  than  or  equal  to  the  response 


-60- 


time  for  C.  Some  conditions  under  which  equality  holds  will  be 
discussed  below. 

4.3  Constraints  on  the  Optimal  Input 

Consider  again  the  standard  problem  as  defined  previously.  It 

has  been  known  for  some  time  that  if  there  exists  any  input  which 

takes  the  system  from  some  initial  state  Z(0)  to  a  final  state  Z(t*) 

44 

and  if  the  general  position  condition  is  satisfied,  then  there 
exists  a  unique  X  which  minimizes  the  transition  time.  Moreover, 
during  the  transition,  X(t)  assumes  limiting  values  only.  That  is, 
|x^|  =  au  for  all  0  <  t  <  t*.  Regardless  of  what  the  value  of  Z(t*) 
is  if  it  can  be  achieved  with  the  allowable  inputs,  the  shortest 
time  will  be  obtained  by  using  inputs  which  assume  limiting  values 
only. 

Now  return  to  the  problems  A,  B,  and  C.  We  may  reason  that 
there  is  associated  with  each  an  optimum  value  of  Z(t*),  i.e.,  a 
optimum  value  of  Z(t)  at  the  time  at  when  Y(t)  and  all  its  devia- 
tives  first  vanish.  From  the  results  cited  in  the  previous  para¬ 
graph  it  follows  that  prior  to  the  time  t#X(t)  is  limiting.  By 
solving  the  system  (4.1)  for  X(t)  in  terms  of  Y(t) ,  as  is  done  when 
computing  the  inverse  equation,  one  can  obtain  an  expression  for 
X(t)  when  t  exceeds  t*.  In  what  follows  we  will  present  a  complete 
argument  only  for  the  case  where  X  and  Y  are  scalrs  but  in  a  later 
section  we  indicate  how  the  results  may  be  extended.  We  also 
normalize  the  system  so  that  the  allowable  values  of  X  are  |x|<  1. 


Let  x  and  y  be  scalars  related  by  (4.1)  and  let  k-1  be  the 
least  nonnegative  interger  such  that  CA"^  is  nonzero.  The  inverse 
equation  is  then  given  by 

Z(t)  =  (A-B(CAk“1B)"1CAk)  Z(t)  +  B(CAk-1B)“Vk^  (t)  (4.2a) 

x(t)  =  -( CAk_1B)  _1CAkZ(t)  +  (CAk“1B)"1y<'k'*(t)  (4.2b) 

The  question  as  to  what  the  set  of  allowable  values  of  Z(t*)  is  in 
each  of  the  cases  Aj  and  C  can  now  be  answered.  Define  F  as 
A-B(CAk”1B)'  ^CAk.  The  permitted  values  of  Z(t*)  in  each  case  are 
defined  implicitly  by  the  relations 

(A)  i)  CAiZ(t*)  =  0  for  i  =  0,  1,  ...  k-1 

ii)  |(CAk  ^B)  ^CAkeF^Z(t#)  (  <  1  .for  t*  <  t  <  oo 

(B)  i)  CA^Z(t*)  *  0  for  i  =  0P  1*  ...  k-1 

ii)  [  (CAk~1B)"1CAkeFtZ(t*)t'  <  1  for  t*  <  t  <  t« 

(C)  i)  CA1Z(t#)  =0  for  i  *  0,  1,  ...  k-1 

ii)  | (GAk_1B)~1CAkZ(t*) |  <  1 

The  only  differences  between  problem  statements  A,  B,  and  C  are 

that  they  specify  different  restrictions  on  Z(t*).  In  situations 

where  the  restrictions  corresponding  to  the  different  formulations 

T  T  T 

coincide  the  problems  become  equivalent.  For  examples  if  ('C  ,k  C  } 

.o  A^kC^)  is  of  rank  Ps  as  is  the  case  if  the  system  is  irreduc¬ 
ible  and  k  =  p  -ls  then  condition  i)  in  A,  B,  and  C  demands  that 
Z(t*)  be  zero.  This  is  the  case  if  the  transfer  function  of  the 
system  has  a  constant  numerator.  For  such  systems  problems  A,  B,  and 
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C  are  all  equivalent.  If  k  K  p  -  2  then  condition  i)  in  each  case 
leaves  one  parameter  free  which  must  be  chosen  in  such  a  way  as  to 
satisfy  ii) .  It  is  easily  seen  that  if  the  one  nonzero  eigenvalue 
of  F  is  negative  then  the  problems  A,  B,  and  €  again  coincide.  If 
the  eigenvalue  is  positive  the  problems  will  in  general  have  quite 
different  solutions  (see  example  below). 

Other  comparisons  between  the  cases  may  be  made  by  making 
alternative  assumptions  on  the  inverse  equation.  The  important 
points  are  3  however3  that  from  the  problem  statement  and  the  inverse 
equation  it  is  possible  to  identify  the  admissible  values  of  Z(t*) 
and  that  the  inverse  equation  and  Z(t*)  completely  define  x(t)  when 
t  exceeds  t*„ 

What  remains  is  to  determine  both  the  best  value  of  Z(t*)  and 
also  the  best  x  for  arriving  at  Z(t*).  This  problem  is  very 
difficult  and  even  in  the  case  where  Z(t*)  is  fixed  there  is  no 
really  satisfactory  general  solution.  There  are  many  ways  to  obtain 
solutions  for  particular  problems,  however,  and  what  we  will  do  here 
is  to  show  that  a  simple  and  effective  procedure  is  to  reduce  the 
problem  to  one  in  nonlinear  programming.  The  basic  idea  is  to  ob¬ 
tain  a  representation  for  the  Laplace  transform  of  x  which  contains 
undetermined  coefficients  and  then  to  derive  constraints  on  these 
coefficients.  The  end  result  is  an  ordinary  extremi zation  problem 

involving  both  nonlinear  and  inequality  constraints.  In  this  re- 

22 

spect  our  approach  is  similar  to  that  of  Ho  .  . 

From  the  maximum  principle  it  follows  that  until  Z(t#)  is 
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reached  x(t)  is  limiting.  After  the  point  Z(t#)  is  reached  x(t)  is 
given  in  terms  of  Z(t#)  as  Ge^Z(t#)  where  G  =  (CA^  ^B)  ^CA^„  Thus 
the  Laplace  transform  of  x(t)^  which  will  be  written  as  x(s)5  can  be 
expressed  as 

x(s)  -  -  (1  +  2  2  (-DW  -(-l)net#S(l  «■  G(Is-F)-1Z(t*))) 

1-1  (4.3) 

where  the  t^  are  restricted  by  the  inequalities 

°  <  <  tg  *”»  tn  <  t*  (4.4) 

and  Z(t*)  must  satisfy  the  restrictions  defined  above,  depending 

on  the  problem  statement.  The  following  lemma  provides  the  basis 

for  the  remaining  restrictions  which  are  to  be  imposed. 

Lemma  5s  Suppose  y(t)  is  uniformly  bounded  on  [0,00^,  Then 

a  necessary  and  sufficient  condition  for  y(t)  to  vanish  identically 

for  t  >  t*  is  that  y(s)  have  no  poles  in  the  finite  part  of  the " 

plane  and  that  lim  jy(s)eS^[  =  0  for  t  >  t*. 

Re  ( s )  —  >-00 

Proof  %  Assume  y(t)  is  uniformly  bounded  on  [o,ooj  by  m  and 
assume  y(t)  vanishes  for  t  >  t*.  Then  [y(s)f  is  given  by 

|y(s)  |  =  [  <  mf  (l-e-st*)/s|  (4.5) 

Since  m(l-e  )/s  has  no  nonremovable  poles  in  the  finite  part  of 

the  plane  and  since  lim  es^m[ (l-e  s^*)/s[  =  0  for  t  >  t*  we  see 
Re(s)— ?>-co 

that  the  conditions  given  are  necessary.  To  illustrate  sufficiency 
it  is  necessary  to  recall  that  y(t)  is  given  by 
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y(t)  =  J  esty(s)ds  (4.6) 

where  p  can  be  taken  as  the  infinite  seimicircle  which  closes  on  the 
left  and  has  as  a  diameter  the  line  Re(s)  -  <7,  provided 

y(cr)  -  f°°e  CTty(t)dt  <  oo  (4.7) 

Jo 

But  since  y(t)  is  assumed  to  be  uniformly  bounded,  cr  can  be 

S"t“” 

taken  to  be  any  real  positive  number.  Because  e  y(s)  has  no  poles 

anywhere  in  the  finite  plane  and  because  lim  |eS^y(s)|  is  finite 

Re(s)  ->-oo 

for  t  >  t*  it  follows  that  the  integral  in  (4.6)  is  zero  if  t  >  t*. 

Q.E.D. 

From  this  lemma  it  follows  that  a  necessary  and  sufficient  con¬ 
dition  for  y(t)  to  vanish  for  t  >  t*  is  that 

(s-Xi)[C(Is-A)"1Bx(s)  -  C(Is-A)"1Z(0)]s=x  =  0  (4.8) 

i 

where  are  the  eigenvalues  of  A,  and  hence  the  poles  of  (Is-A)  \ 
and  x(s)  is  assumed  to  have  the  form  given  by  equation  (4.3).  Using 
(4.3)  in  (4.8)  it  follows  that  the  t^  should  be  selected  so  that  t* 
is  a  minimum  and 

-  C(ls-A)~1B(l+2  2  (-l)1e"Sti-(-l)ne~st*(HG(Is-F)"1Z(t*))Js=x  - 

i=l  i 

-  !C(Is-A)_1Z(0)Js=x  (4.9) 

l 

This  equation,  together  with  the  requirement  (4.4)  and  the  con¬ 
ditions  on  Z(t*)  imposed  by  the  problem  statement,  completely 
defines  the  solution  to  the  time-optimal  problem.  In  general  both 
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the  and  Z(t*)  are  unknown  and  must  be  selected  in  such  a  way  as 
to  minimize  t*.  Equation  (4.9)  provides  a  total  of  p  equations  so 
that  in  general  one  would  expect  that  if  Z(t*)  can  vary  over  a 
a-parameter  fmaily  then  n  will  have  to  be  at  least  p-a-1  but  there 
is  no  guarantee  that  this  number  is  large  enough  to  ensure  that  the 
optimum  solution  will  be  achieved. 


4,4  Example  s 

To  illustrate  the  application  of  these  ideas  and  to  indicate 

the  types  of  solutions  one  may  expect  we  now  consider  two  examples. 

The  first  of  these  is  a  second  order  system  which  has  been  examined, 

4 

but  not  solved,  by  Auhanassiades  and  Falb  .  The  equations  of  motion 
21  r© 

k(t)|  |o  i  |  k(t)|  |i 

x(t)  (4.10a) 


- 1 

C+- 

'-'.1 

0 

i 

zi(t> 

+ 

[i 

z?(t) 
L  *  _ 

-Sc 

-3 

0 

y(t)  =[l  0] 


zl^ 

z2(t) 


(4.10b) 


and  the  magnitude  of  x(t)  is  restricted  to  be  less  than  or  equal  to 
one.  These  equations  define  a  system  those  transfer  function  is 
(s+3)/(s+l)(s+2) . 

Our  objective  is  to  find  the  optimal  x(t)  as  a  function  of  the 
initial  values.  As  a  first  step  we  compute  the  inverse  equation  as 
given  by  (4.2)  to  get 
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(4.11a) 


x(t)  »-jp  l]  z^t)  +  y(t)  (4.11b) 

_Z2(t)_ 

Since  the  essential  part  of  the  inverse  equation  is  stable  and 
is  of  first  order  it  follows  that  problems  A,  B,  and  C  are  all 
equivalent.  Moreover ,  by  setting  y(t)  equal  to  zero  it  follows 
that  x  =  z^(t*)e  for  t  >  t*.  Therefore  |zg(t#)|  <  1  is  the  re¬ 
quirement  on  Zg(t*). 

As  noted  above,  in  general  it  will  not  be  known  in  advance  what 
the  optimum  number  of  switches  is.  However,  for  this  problem  it  is 
known  to  be  zero  or  one  because  the  eigenvalues  are  real  and 
negative.  Because  of  this  x(s)  can  be  written  as 

x(s)  =  -  (l-2e  Stl  +e  st*(l+as/(s+3) ) )/s  (4.12a) 

°  <  ^  <  t*  ;  |  a|  <  1  (4.12b) 

Let  z^(0)  =  u  and  z^O)  *  v„  The  constraints  of  equation 
(4.9)  can  be  written  as 

(s+3)(l-2e  S^1  +eS^*(l+as/(s+3) ) )/s-(s+3)u+vJ g  _  0  (4.13) 

■fc 

By  letting  r^  =  e  1  and  r,,  =  e  the  entire  problem  can  be  expressed 

~~ 

The  switches  are  counted  as  in  reference  44.  That  is  the  number  of 
switches  equals  one  minus  the  number  of  distinct  intervals  over 
which  x  is  a  constant. 
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succintly  as  follows s  Minimize  r^  subject  to  the  contraints 

2rl  r2^"a/2)  =  1  “  (u  +  v/2)  (4.14a) 

Zt1Z~  r^1_2a)  =  1  -  (2u  +  2v)  (4.14b) 

j'aj  <1  $  1  <  <  r^  (4.14c) 

This  is  a  nonlinear  programming  problem  and  although  not  a 
great  deal  is  available  in  the  way  of  a  general  theory^,  many 
problems  of  this  type  can  be  solved  either  by  hand  or  on  a  computer. 
This  particular  problem  can  be  solved  to  give  r^  and  r,,  as  a 
function  of  u  and  v,  but  the  solution  is  rather  involved.  For  con¬ 
trol  purposes  it  is  more  important  to  be  able  to  determine  the  input 

as  a  function  of  Z(t).  For  this  example  this  information  is  con¬ 
tained  in  Fig,  4.1.  For  values  of  Z(t)  lying  to  the  right  of  the 
switch  curve  X(t)  should  be  -1,  for  values  lying  to  the  left,  X(t) 
should  be  +1. 

Fig.  4.1  also  shows  a  typical  optimum  response  curve.  The  form 
of  the  x  producing  it  is  shown  in  Fig.  4.2.  For  the  particular 
initial  values  chosen  the  optimum  input  only  assumes  one  limiting 
value.  This  will  be  the  case  whenever  Z(0)  is  in  the  shaded  region 
shown  in  Fig.  1,  Otherwise  the  optimum  X(t)  will  take  on  values  of 
both  +1  and  -1.  It  is  interesting  to  observe  that  if  jZ(0)|  is 
sufficiently  small  then  no  switches  are  required  and  even  if  J Z (0) | 
gets  arbitrarily  large  most  initial  values  still  require  no  switches. 
This  is  in  sharp  contrast  with  the  second  order  system  with  con¬ 
stant  numerator  and  real  roots  which  requires  one  switch  everywhere 


Figure  1:  The  switching  curve  for  time -optimal  con¬ 
trol  of  a  system  whose  transfer  function  is  (s+3)/ 
(s+l)£s+2).  The  equation  for  the  switch  curve  is  z2  = 
-22^  -  (1  +  V 6 z ^ ) .  The  optimum  response  from  (2,-3J  is 
shown  and  the  zero-switch  region  is  shaded .  It  is 
bounded  by  the  curve  z2  =  -2z^  -  (3  -  V 4+2z  ^ ) . 


- — I - n - - - -  i  \  r  ’  I  l  * 

i.  X  3  4 -  t 

-1 - 

Figure  2:  The  optimal  input  for  the  system  of  figure  1. 
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ex  cept  on  the  switch  curve. 

As  an  example  of  a  system  for  which  problems  Aj  B,  and  €  have 
different  solutions  consider  the  second  order  system 


zl(t) 

. 

0  1 

^(t) 

-1 

Z2(t> 

-2  -3 

_Z2(t)_ 

4 

y(t) 

-( 

Lo  i„ 

zl(t) 

z2(t) 

x(t) 


(4.15a) 

(4.15b) 


This  system  has  a  transfer  function  of  (l-s)/(l+s)  (2+s)  „  Its  in¬ 
verse  equation  is 
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1 

_z2(t> 
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x(t) 
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[o  l] 

zl(t> 
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y(t) 


Z2(t) 


(4.14a) 

(4.16b) 


and  is  unstable.  If  y(t)  vanishes  identically  for  t  >  t*  then  z^(t) 
=  0  and  x(t)  =  Zg(t*)et  for  t  >  t*.  From  this  it  follows  that  in 
all  cases  z  (t*)  =  0.  For  problem  A  z^(t*)  must  vanish*  for  problem 
Bjzg(t#)j  must  be  less  than  or  equal  to  ln(t’-t*)*  while  for  problem 
C  it  is  merely  required  that  | z^(t*) |  be  less  than  or  equal  to  one. 

The  constraint  equations  can  be  set  up  as  before.  The  major 
difference  is  that  for  problem  A  the  solution  is  considerably 
easier  due  to  the  vanishing  of  z,,(t*) .  The  switching  curves  for 
problems  A  and  C  are  shown  in  Fig.  4.3  and  4.5,  Again  those  regions 
for  which  x(t)  assumes  only  one  limiting  value  have  been  shaded. 


Figure  4:  The  optimal  input  for  the  system  of  figure  3. 


Figure  5:  The  time --optimal  switching  curve  for  a 
system  whose  transfer  function  is  ( 1 -s) / ( s+1) ( s+2) , 
problem  C.  The  equation  for  the  switching  curve  is  z 
-2z^  -  (1  -  l/2z^  .  The  optimum  response  from  (l,-3.5 

is  shown  and  the  zero  switch  region  is  shaded. _ It  is 

bounded  by  the  curve  z2  =  -2z^  -  1/5(1- V  36-90 z^) 


X(t) 

1 


-1 

Figuye  6:  The  optimal  input  for  the  system  of  figure 
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Figures  4.3  and  4.5  also  show  the  form  of  the  optimum  response 
from  the  initial  point  (l,-3.5).  As  would  be  expected  the  response 
time  for  problem  A  is  greater  and  the  optimum  inputs  in  the  two 
cases  have  quite  different  forms. 

For  comparison  the  switch  curve  of  l/(s+l)(s+2)  is  given  in 
Fig.  (4.7).  For  such  a  system  x(s)  always  assumes  both  limiting 
values  unless  Z(0)  lies  on  the  switch  curve.  Evidently  if  the 
inverse  equation  is  unstable  then  the  formulation  given  by  A  yields 
a  switching  policy  is  quite  similar  to  that  which  is  obtained  for 
systems  having  a  constant  numerator. 

One  point  which  should  be  emphasized  is  that  formulations  A 
and  B  do  not  have  optimum  inputs  which  can  be  generated  by  ideal 
relays.  While  it  is  true  that  the  input  is  limiting  up  until 
y(t)  s  0  from  this  point  on  x(t)  is  generated  by  the  inverse 
equation.  Thus  the  optimal  controller  must  include  the  inverse 
system  as  indicated  in  Fig.  (4.8). 

4.5  More  General  Systems 

The  basic  approach  here  has  been  to  express  the  optimum  X  in 
terms  of  undetermined  coefficients  and  then  to  determine  these 
coefficients  in  such  a  way  as  to  minimize  the  response  time.  The 
inverse  equation  was  used  to  help  parametrize  the  optimum  input. 

Now  consider  an  n -input s  n-output  system  which  has  a  transfer  matrix 
P(s).  It  is  obvious  from  lemma  3  that  if  Xg(s)  =^°°e  S"^X(t)dt 

J  t -if- 

then  P(s)Xg(s)  can  have  poles  only  where  P(s)  has  poles.  This 
condition  is  equivalent  to  saying  that  X.,(t)  must  be  generated  from 


Figure  7:  The  time -optimal  switching  curve  for 
system  whose  transfer  function  is  l/( s+l^( s+2]^ 
equation  for  the  switch  curve  z2  =  -2z-^  -  \j 2z^ 


a 

The 


Figure  8:  The  structure  of  the  optimal  open-loop  con¬ 
troller  showing  the  role  of  the  inverse  system. 
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the  inverse  equation. 

This  type  of  constraint  also  arises  in  solving  least-squares 
1  8  50  8 

problems  5  J  .It  has  been  shown  that  Xg(s)  can  have  poles  only 
at  points  where  det  P(s)  =  0  and  hence  if  det  P(s)  has  no  zeros  we 
are  dealing  with  the  multivariable  equivalent  of  the  case  where  the 
numerator  of  the  transfer  function  is  a  constant.  For  such  a  system 
problems  A,  B,  and  G  are  all  equivalent  and  Xg(t)  s  0.  It  should  be 
observed,  however,  that  the  condition  that  det  P(s)  has  no  zeros  is 
a  very  special  one  and  therefore  the  distinction  between  problems 
A,  Bs  and  G  is  very  likely  to  be  a  significant  one  when  dealing 
with  multivariable  systems. 

4.6  Conclusion 

In  this  chapter  it  has  been  shown  how  the  inverse  equation  can 
be  used  to  advantage  in  solving  certain  computational  problems 
associated  with  time-optimal  control  and  it  has  been  shown  by  the 
structure  of  the  inverse  equation  determines  the  structure  of  the 
time-optimal  input.  Several  alternative  statements  of  the  time- 
optimal  problem  have  been  proposed  and  their  differences  and 
similarities  have  been  related  to  the  inverse  equation.  Examples 
dealing  with  single-input,  single-output  systems  have  been  given 
and  an  extension  to  the  multivariable  case  has  been  indicated. 


CHAPTER  V 


CONCLUSIONS  AND  EXTENSIONS 


5.1  Summary 

The  principle  objective  of  this  research  was  to  develop  a  means 
for  characterizing  multi -input.,  multi -output  systems  whose  behavior 
is  governed  by  differential  equations.  In  particular,  it  was  de¬ 
sired  to  obtain  a  characterization  which  would  accurately  reflect 
those  properties  of  such  systems  which  are  of  interest  in  automatic 
control.  With  this  in  mind  it  was  shown  how  a  system  could  be 
viewed  as  a  mapping  whose  domain  is  a  set  of  functions  and  whose 
range  may  be  either  a  set  of  functions  or  a  set  of  equivalence 
classes.  By  characterizing  the  response  space  in  various  ways  it 
was  possible  to  interpret  the  familiar  properties  of  a  system  such 
as  stability  and  minimum  phasity  in  terms  of  the  continuity  of  these 
mappings  and  new  properties  of  interest  were  defined  as  well. 

It  was  shown,  for  example,  that  the  inverse  equation  not  only 
provides  a  means  for  studying  the  continuity  of  the  inverse  mapping 
but  also  is  of  importance  in  finding  the  solution  to  optimal  control 
problems.  The  concept  of  nonsingularity  as  usually  applied  to 
transfer  matrices  was  applied  to  systems  which  are  represented  in 
differential  equation  form.  The  problem  of  asymptotic  controll¬ 
ability  was  posed  and  the  conditions  for  asymptotic  controllability 
were  expressed  both  in  terms  of  differential  equation  representa¬ 
tions  and  in  terms  of  transfer  matrix  representations. 
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Because  of  the  practical  importance  of  feedback  an  effort  was 
made  to  determine  what  effect  it  has  on  a  given  transformation.  It 
was  shown  that  although  derivative  feedback  can  affect  stability  it 
cannot  make  a  singular  system  nonsingular  nor  can  it  affect  its 
irreducibility  or  asymptotic  controllability  provided  it  is  of 
sufficiently  low  order. 

Throughout,  the  importance  of  the  inverse  equation  has  been 
emphasized.  In  Chapter  IV  it  has  been  shown  how  the  inverse 
equation  can  be  used  to  help  solve  certain  computational  problems 
associated  with  time-optimal  control  and  examples  have  been  given 
to  illustrate  this  idea. 

5.2  Future  Research 

In  view  of  the  importance  of  the  inverse  equation  it  would  be 
desirable  to  have  an  expression  for  it  which  would  encompass  all 
linear  time -in variant  systems.  On  the  bases  of  the  work  done  in 
Chapter  II  it  is  clear  how  one  should  proceed  to  construct  the  in¬ 
verse  equation  for  any  given  linear  system  but  no  completely 
general  formula  was  given.  Similarly,  it  would  be  convenient  to 
have  an  explicit  formula  for  a  matrix  whose  eigenvalues  correspond 
to  those  of  the  inverse  equation.  An  extension  of  these  ideas  to 
time-varying  linear  systems  would  also  be  of  interest. 

Certain  aspects  of  asymptotic  controllability  are  very  in¬ 
teresting  and  merit  further  study.  For  example  in  Chapter  I  it  was 
shown  that  there  exist  single -input ,  multi-output  systems  which 
are  asymptotically  controllable.  Clearly  this  type  of  system  cannot 
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be  accurately  represented  by  a  time-invariant  linear  model.  The 
questions  of  how  often  systems  of  this  type  occur  in  practice  and 
how  such  systems  should  be  modeled  would  seem  to  be  of  considerable 
importance . 

The  formalism  used  here  also  can  be  used  to  define  a  measure  of 
the  accuracy  of  a  model.  For  example  if  a  physical  system  maps  X 
into  Y  and  if  a  proposed  model  of  the  system  maps  X  into  then  a 
measure  of  the  accuracy  of  the  model  is  given  by 

T»'  S  l|M”u  (5-1> 

x 

where  the  norm  may  be  chosen  in  any  way  so  as  to  reflect  those 

properties  of  the  system  which  are  of  particular  interest.  The 

problem  of  defining  the  type  of  interaction  measures  suggested  by 
41 

Mesarovic  can  be  treated  in  a  similar  way. 
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